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HUMAN TRANSPORT PROTEINS 

TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of human transport proteins 
and to the use of these sequences in the diagnosis, treatment, and prevention of transport, metabolic 
neurological, cardiovascular, reproductive, and immune disorders, and cell proliferative disorders 
including cancer. 
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BACKGROUND OF THE INVENTION 
Eukaryotic cells are surrounded and subdivided into functionally distinct organelles by 
hydrophobic lipid bilayer membranes. These membranes act as a barrier to most molecules, and 
mamtain the essential differences between the cytosol, the extracellular environment, and the contents 
of each mtracellular organelle. Transport of essential nutrients, certain metal ions, metabolic waste 
15 products, cell signaling molecules, macromolecules, and proteins across lipid membranes and 
between organelles must be mediated by a variety of transport molecules. Transport between the 
cytoplasm and the extracellular environment, and between the cytoplasm and lumenal spaces of 
cellular organelles requires specific transport proteins. Each transport protein carries a particular 
class of molecule, such as ions, sugars, or amino acids, and often is specific to a certain molecular 
20 species of the class. 

Cells and organelles require transport molecules to import and export essential nutrients and 
metal ,ons including K\ NH/, Pjl SO/ , sugars, and vitamins, as well as various metabolic waste 
products. Transport proteins also play roles in antibiotic resistance, toxin secretion, ion balance, 
synaptic neurotransmission, kidney function, intestinal absorption, tumor growth, and other diverse 
25 cell functions (Griffith, J. and C. Sansom (1998) The Transnnrtpr Parts RnnV Academic San 
Diego CA. pp. 3-29). Transport can occur by a passive concentration-dependent mechanism, or can 
be hnked to an energy source such as ATP hydrolysis or an ion gradient. Proteins that function in 
transport include carrier proteins, which bind to a specific solute and undergo a conformational 
change that transfers the bound solute across the membrane, and channel proteins, which form 
30 hydrophilic pores that allow specific solutes to diffuse through the membrane down an 
electrochemical solute gradient. 

Transport proteins are often multi-pass transmembrane proteins, which either actively 
transport molecules across the membrane or passively allow them to cross. Active transport involves 
d.rect,onal pumping of a solute across the membrane, usually against an electrochemical gradient 
35 Acnve transport is tightly coupled to a source of metabolic energy, such as ATP hydrolysis or an 
electrochemically favorable ion gradient. Passive transport involves the movement of a solute down 
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its electrochemical gradient. Transport proteins can be further classified as either carrier proteins or 
channel proteins. Carrier proteins, which can function in active or passive transport, bind to a 
specific solute to be transported and undergo a conformational change which transfers the bound 
solute across the membrane. Channel proteins, which only function in passive transport, form 
5 hydrophilic pores across the membrane. When the pores open, specific solutes, such as inorganic 
ions, pass through the membrane and down the electrochemical gradient of the solute. Examples 
mclude facilitative transporters, the secondary active symporters and antiporters driven by ion 
gradients, and active ATP binding cassette transporters involved in multiple-drug resistance and 
targeting of antigenic peptides to MHC Class I molecules. Transported substrates range from 
10 nutrients and ions to a broad variety of drugs, peptides and proteins. 

Information on the action of ARL-6 (ADP-ribosylation like factor), an endoplasmic reticulum 
transmembrane protein, can be found in Greenfield. J.J. and S. High (1999; J. Cell Sci. 1 12:1477- 
1486). Information on reduced folate carrier transporter proteins can be found in Dixon. K.H. et al 
(1994; J. Biol. Chem. 269: 17-20) and Moscow, J.A. et al. (1995; Cancer Res. 55:5983-5987). 
15 Carrier proteins which transport a single solute from one side of the membrane to the other 

are called uniporters. In contrast, coupled transporters link the transfer of one solute with 
simultaneous or sequential transfer of a second solute, either in the same direction (symport) or in the 
opposite direction (an.iport). For example, intestinal and kidney epithelia contain a variety of 
symporter systems driven by the sodium gradient that exists across the plasma membrane. Sodium 
moves into the cell down its electrochemical gradient and brings the solute into the cell with it. The 
sodrum gradient that provides the driving force for solute uptake i s maintained by the ubiquitous 
NaTK* ATPase. Sodium-coupled transporters include the mammalian glucose transporter (SGLT1 ) 
iodide transporter (NIS). and multivitamin transporter (SMVT). All three transporters have twelve ' 
putanve transmembrane segments, extracellular glycosylate sites, and cytoplasmically-oriented N- 
25 and C-termini. 

Mitochondrial carrier proteins are transmembrane-spanning proteins which transport ions and 
charged metabolites between the cytosol and the mitochondrial matrix. Examples include the ADP, 
ATP carrier protein; the 2-oxoglutarate/malate carrier; the phosphate carrier protein; the brown fat 
uncoupling protein which transports protons from the cytosol into the matrix; the pyruvate carrier the 

30 dtcarboxylate carrier which transports malate, succinate, fumarate. and phosphate; the tricarboxylate 
earner which transports citrate and malate; and the Grave's disease carrier protein, a protein 
recognized by IgG in patients with active Grave's disease, an autoimmune disorder resulting in 
hyperthyroidism (Stryer, L. (1995) BJpch^nustry, W.H. Freeman and Company, New York NY p 
551; PROSITE PDOC00189 Mitochondrial energy transfer protems signature; Online Mendelian 

35 Inheritance in Man (OMIM) *275000 Graves Disease). 

This class of transporters also includes the mitochondrial uncoupling proteins, which create 
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proton leaks across the inner mitochondrial membrane, thus uncoupling xidati ve phosphorylation 
from ATP synthesis. The result is energy dissipation in the form of heat. Mitochondrial uncoupling 
proteins have been implicated as modulators of thermoregulation and metabolic rate, and have been 
proposed as potential targets for drugs against metabolic diseases such as obesity (Ricquier. D. et al. 
(1999) J. Int. Med. 245:637-642). 

A number of metal ions such as iron, zinc, copper, cobalt, manganese, molybdenum, 
selenium, nickel, and chromium are important as cofactors for a number of enzymes. For example, 
zinc is required for the function of enzymes such as the extracellular matrix metalloproteinases, and 
zinc ions stabilize several motifs commonly found in transcription factors, including zinc fingers, zinc 
clusters, and UM domains. Zinc and other metal ions must be provided in the diet, and are absorbed 
by transporters in the gastrointestinal tract. Plasma proteins transport the metal ions to the liver and 
other target organs, where specific transporters move the ions into cells and cellular organelles as 
needed. Imbalances in metal ion metabolism have been associated with a number of disease states 
(Danks, D.M. (1986) J. Med. Genet. 23:99-106). 
15 The largest and most diverse family of transport proteins known are the ATP-binding cassette 

(ABC) transporters. As a family, ABC transporters can transport substances that differ markedly in 
chemical structure and size, ranging from small molecules such as ions, sugars, amino acids, peptides, 
and phospholipids, to lipopeptides, large proteins, and complex hydrophobic drugs. ABC proteins 
consist of four modules: two nucleotide-binding domains (NBD), which hydrolyze ATP to supply the 
energy required for transport, and two membrane-spanning domains (MSD), each containing six 
putative transmembrane segments. These four modules may be encoded by a single gene, as is the 
case for the cystic fibrosis transmembrane regulator (CFTR), or by separate genes. When encoded by 
separate genes, each gene product contains a single NBD and MSD. These "half-molecules" form 
homo- and heterodimers, such as Tapl and Ta P 2. the endoplasmic reticulum-based major 
25 histocompatibility (MHC) peptide transport system. Several genetic diseases are attributed to defects 
in ABC transporters, such as the following diseases and their corresponding proteins: cystic fibrosis 
(CFTR, an ion channel), adrenoleukodystrophy (adrenoleukodystrophy protein, ALDP), Zellweger 
syndrome (peroxisomal membrane protein-70. PMP70), and hyperinsulinemic hypoglycemia 
(sulfonylurea receptor, SUR). Overexpression of the multidrug resistance (MDR) protein, another 
30 ABC transporter, in human cancer cells makes the cells resistant to a variety of cytotoxic drugs used 
in chemotherapy (Taglicht, D. and S. Michaelis (1998) Methods Enzymol. 292:131-163). 

The nuclear pore complex (NPC) is a large multiprotein complex spanning the nuclear 
envelope which mediates the transport of proteins and RNA molecules between the nucleus and the 
cytoplasm, thus contributing to the regulation of gene expression. The NPC allows passive diffusion 
35 of ions, small molecules, and macromolecules under about 60kD, while larger macromolecules are 
transported by facilitated, energy-dependent pathways. Nuclear localization signals (NLS), consisting 
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of short stretches of amino acids enriched in basic residues, are found on proteins that are targeted to 
the nucleus, such as the glucocorticoid receptor. The NLS is recognized by the NLS receptor, 
.mportin, which then interacts with the monomeric GTP-binding protein Ran. This NLS 
protein/receptor/Ran complex navigates the nuclear pore with the help of the homodimeric protein 
5 n ^transportfactor2(N^^ «, 42 0- 
429; Gorlich, D. (1997) Curr. Opin. Cell Biol. 9:412-419). Four O-linked glycoproteins, p62, P 58, 
P54, and p45, exist as a stable "p62 complex" that forms a ring localized on both nucleoplasm* and 
cytoplasmic surfaces of the NPC. The P 62. p58, and P 54 proteins all interact directly with the 
cytosolic transport factors P 97 and NTF2. suggesting «ha, the p62 complex is an important ligand 
10 bmdmg s,te near the central gated channel of the NPC (Hu, T. et al. (1996) J. Cell Biol. 134:589-601). 

Transport can also occur through intercellular bridges which connect the cytoplasms of sister 
cells, for example in the male and female germline of species ranging from fruit flies to humans. 
These bridges allow passage of cytoplasmic materials between cells during development. 
Intercellular bridges have also been found to connect somatic cells. The nurse cells and oocyte of a 
15 Drosophila egg chamber, which are derived from a single precursor cell through four rounds of 

rmtosis, are connected to each other through intercellular bridges called ring canals. The cells do not 
completely separate after mitosis; the mitotic cleavage furrows are transformed into ring canals by the 
addrfon of an actin cytoskeleton lining the tunnels between the cells. The Drosonhi., ke , ch prot ein 
functions ,n organizing actin in the ring canal. Mutations in kelch cause female sterility in 
20 Drosophila . Kelch contains four protein domains: the NTR domain at the N-terminus, the BTB or 
POZ domain, the IVR or intervening region; and the kelch repeat domain, which contains six 50- 
amino acid kelch repeats. The BTB or POZ domain, a 120-amino acid motif that is also found in 
several zinc-finger containing transcription factors, may be important in dimerization of kelch. Kelch 
repeats are found in other proteins as well and may be important for actin binding (Robinson D N 
25 and L. Cooley (1997) J. Cell Biol. 138:799-810; Cooley, L. (1998) Cell 93:913-915). 

Ion Channels 

The electrical potential of a cell is generated and maintained by controlling the movement of 
.ons across the plasma membrane. The movement of ions requires ion channels, which form an ion- 

30 selects pore within the membrane. Ion channels share common structural and mechanistic themes. 
The channel consists of four or five subunits or protein monomers that are arranged like a barrel in 
the plasma membrane. Each subunit typically consists of six potential transmembrane segments (SI, 
S2,S3.S4,S5,andS6). The center of the barrel forms a pore lined by a-helices or p-strands. The 
side chains of the amino acid residues comprising the a-helices or B-stmnds establish the charge 

35 (canon or anion) selectivity of the channel. The degree of selectivity, or what specific ions are 

allowed ,o pass through the channel, depends on the diameter of the narrowest part of the pore There 
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are two basic types of ion channels, ion transporters and gated ion channels. Ion transporters utilize 
the energy obtained from ATP hydrolysis to actively transport an i n against the ion's concentration 
gradient. Gated ion channels allow passive flow of an ion down the ion's electrochemical gradient 
under restricted conditions. Together, these types of ion channels generate, maintain, and utilize an 
electrochemical gradient that is used in 1) electrical impulse conduction down the axon of a nerve 
cell, 2) transport of molecules into cells against concentration gradients, 3) initiation of muscle 
contraction, and 4) endocrine cell secretion. 

Transmembrane ATPases are divided into three families. The phosphorylated (P) class ion 
transporters, including NaMC* ATPase. Ca" ATPase, H* ATPase, and Cu~ ATPase, are activated by 
a phosphorylation event. P-class ion transporters are responsible for maintaining resting potential 
distributions such that cytosolic concentrations of W and Ca" are low and cytosolic concentration 
of K* is high. The vacuolar (V) class of ion transporters include H* pumps on intracellular 
organelles, such as lysosomes and Golgi. V^lass ion transporters are responsible for generating the 
low P H within the lumen of these organelles that is required for function. The coupling factor (F) 
15 class consists of H* pumps in the mitochondria. F-class ion transporters utilize a proton gradient to 
generate ATP from ADP and inorganic phosphate (P ( ). 

Cu" ATPases export copper from cells (PROSITE PDOC00139 E1-E2 ATPases 
Phosphorylation site). Mutations in one Cu~ ATPase cause Wilson disease, in which toxic amounts 
of copper accumulate in a number of organs, particularly the liver and brain (Tanzi, R.E. et al. (1993) 
Nat. Genet. 5:344-350). Mutations in another Cu~ ATPase cause Menkes disease and occipital horn 
syndrome. Menkes disease mutations block export of copper from the gastrointestinal tract, leading 
to skeletal abnormalities, severe mental retardation, neurologic degeneration, and mortality in early 
childhood (Harrison, M.D. and C.T. Dameron (1999) J. Biochem. Mol. Toxicol. 13:93-106). 
Occipital hom syndrome mutations cause connective tissue defects (Harrison, supra; Levinson, B. et 
25 al. (1996) Hum. Mol. Genet. 5:1737-1742). 

The coupling factor (F) class of ion transporters consists of H* pumps in mitochondria, 
chloroplasts, and bacteria. For example, the F^. ATPase utilizes a proton gradient across the inner 
mitochondrial membrane to generate ATP from ADP and inorganic phosphate (PJ. The FJF, ATPase 
is composed of the F 0 complex, which is the transmembrane channel through which protons flow, and 
30 the F, complex, where ATP synthesis activity resides. F 0 has three subunits. A (also known as protein 
6), B, and C (Lodish, H. et al. ( 1995) Molecular Cell Biolopy, Scientific American Books, New York 
NY, pp. 752-756; PROSITE PDOC00420 ATP synthase a subunit signature). 

Voltage-gated Ca 2 * channels are involved in presynaptic neurotransmitter release, and heart 
and skeletal muscle contraction. The voltage-gated Ca - channels from skeletal muscle (L-type) and 
35 brain (N-type) have been purified and, though their functions differ dramatically, they have similar 
subunit compositions. The channels are composed of three subunits. The a, subunit forms the 
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membrane pore and voiiage sens r, while the 0,5 and 0 subunits modulate the voltage-dependence, 
gating properties, and the current amplitude of the channel. These subunits are encoded by at least 
six a„ one a^, and four p> genes. A fourth subunit, y, has been identified in skeletal muscle (Walker, 
D. et al. (1998) J. Biol. Chem. 273:2361-2367; and Jay, S.D. et al. (1990) Science 248:490-492). The 
5 human 04 subunit is homologous to the mouse epilepsy gene lethargic, and is a candidate for 

involvement in neurological disorders including ataxia and absence epilepsy (Escayg, A. et al. (1998) 
Genomics 50: 14-22). 

Ligand-gated channels open their pores when an extracellular or intracellular mediator binds 
to the channel. Neurotransmitter-gated channels are channels that open when a neurotransmitter 
10 binds to their extracellular domain. These channels exist in the postsynaptic membrane of nerve or 
muscle cells. There are two types of neurotransminer-gated channels. Sodium channels open in 
response to excitatory neurotransmitters, such as acetylcholine, glutamate. and serotonin. This 
opening causes an influx of Na* and produces the initial localized depolarization that activates the 
voltage-gated channels and starts the action potential. Chloride channels open in response to 
15 inhibitory neurotransmitters, such as y-aminobutyric acid (GABA) and glycine, leading to 
hyperpolarization of the membrane and the subsequent generation of an action potential. 

Ion channels are expressed in a number of tissues where they are implicated in a variety of 
processes. CNG channels, while abundantly expressed in photoreceptor and olfactory sensory cells, 
are also found in kidney, lung, pineal, retinal ganglion cells, testis, aorta, and brain. Calcium-activated 
20 K* channels may be responsible for the vasodilatory effects of bradykinin in the kidney and for 

shunting excess K* from brain capillary endothelial cells into the blood. They are also implicated in 
repolarizing granulocytes after agonist-stimulated depolarization (Ishi, T.M. et al. (1997) Proc. Natl. 
Acad. Sci. USA 94:1 1651-1 1656). Another transmembrane protein, the leukotrine B4 receptor (BLT) 
appears to be involved in inflammation responses and host cell defense against infection. BLT also 
25 functions as an HTV coreceptor (Izumi, T. et al. (1997) Nature 387:620-624; Martin, V. et al. (1999) 
J. Biol. Chem. 274:8597-8603). 

Ion channels have been the target for many drug therapies. Neurotransmitter-gated channels 
have been targeted intherapies for treatment of insomnia, anxiety, depression, and schizophrenia. 
Voltage-gated channels have been targeted in therapies for arrhythmia, ischemic stroke, head trauma, 
30 and neurodegenerative disease (Taylor, CP. and L.S. Narasimhan (1997) Adv. Pharmacol. 39:47-98). 
K* channels are located in all cell types, and may be regulated by voltage, ATP 
concentration, or second messengers such as Ca~ and cAMP. In non-excitable tissue, K* channels 
are involved in protein synthesis, control of endocrine secretions, and the maintenance of osmotic 
equilibrium across membranes. In neurons and other excitable cells, in addition to regulating action 
35 potentials and repolarizing membranes, K* channels are responsible for setting resting membrane 
potential. The cytosol contains non-diffusible anions and. to balance this net negative charge, the cell 



6 



10 



WO 00/78953 PCT/USOO/16668 

contains a Na + -K* pump and ion channels that provide the redistribution of Na\ K + , and CI". The 
pump actively transports N«- out f the cell and K* into the cell in a 3:2 ratio. I n channels in the 
plasma membrane allow K* and CI to flow by passive diffusion. Because of the high negative charge 
within the cytosol, CI " flows out of the cell. The flow of K* is balanced by an electromotive force 
pulling K* into the cell, and a K* concentration gradient pushing K* out of the cell. Thus, the resting 
membrane potential is primarily regulated by K'flow (Salkoff. L. and T. Jegla (1995) Neuron 15:489- 
492). Information on NY-REN-45, a K + channel integral membrane protein, can be found in 
Scanlan, M.J. et al. (.998; Int. J. Cancer 76:652-658). The emopamil-binding protein (EBP) shares 
structural features with both pro- and eukaryotic drug transport proteins (Hanner, M. et al. (1995) J. 
Biol. Chem. 270:7551-7557). The Na + channel, transmembrane protein myelin protein zero (MPZ) 
may be responsible for some sporadic cases of Dejerine-Scottas disease (hereditary motor and 
sensory neuropathy type 111) (Hayasaka, K. et al. (1993) Nat. Genet. 5:266-268). 

K + pore-forming subunits generally have six transmembrane-spanning domains with a short 
region between the fifth and sixth transmembrane regions that senses membrane potential; and the 
15 amino and carboxy termini are located intracellularly. In mammalian heart, the duration of 
ventricular action potential is controlled by a K* current. Thus, the K* channel is central to the 
control of heart rate and rhythm. K* channel dysfunctions are associated with a number of renal 
diseases including hypertension, hypokalemia, and the associated Banter's syndrome and Getelman's 
syndrome, as well as neurological disorders including epilepsy. K* channels have been implicated in 
Alzheimer's disease by observations that a significant component of senile plaques, beta amyloid or 
A beta, also blocks voltage-gated potassium channels in hippocampal neurons (Antes, L.M. et al. 
(1998) Seminar Nephrol. 18:31-45; Stoffel, M. and L.Y. Jan (1998) Nat. Genet. 18:6-8; Madeja, M. 
et al. (1997) Eur. J. Neurosci. 9:390-395; Good, T.A. et al. (1996) Biophys. J. 70:296-304). 

Gated ion channels control ion flow by regulating the opening and closing of pores. These 
channels are categorized according to the manner of regulating the gating function. Mechanically- 
gated channels open pores in response to mechanical stress, voltage-gated channels open pores in 
response to changes in membrane potential, and ligand-gated channels open pores in the presence of a 
specific ion, nucleotide, or neurotransmitter. 

Voltage-gated Na* channels are responsible for electrical excitability of neurons, skeletal 
muscle, heart, and neuroendocrine tissues. For example, the sequential opening and closing of 
voltage-gated Na* channels results in the propagation of action potentials down neuronal axons. Na* 
channels isolated from rat brain tissue are heterotrimeric complexes composed of a 260 kDa pore 
forming a subunit that associates with two smaller auxiliary subunits, pi and P2. The P2 subunit is 
an integral membrane glycoprotein that contains an extracellular Ig domain, and its association with a 
35 and pi subunits correlates with increased function of the channel, a change in the channel's gating 
properties, as well as an increase in whole cell capacitance (Isom. L.L. et al. (1995) Cell 83:433-442). 
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Integral Membrane Pmtpins 

The majority of known integral membrane proteins are transmembrane proteins (TM) which 
are characterized by an extracellular, a transmembrane, and an intracellular domain. TM domains are 
5 typically comprised of 15 to 25 hydrophobic amino acids which are predicted to adopt an a-helical 
conformation. TM proteins are classified as bitopic (Types I and H) and polytopic (Types m and IV) 
(Singer. S J. (1990) Annu. Rev. Cell Biol. 6:247-96). Bitopic proteins span the membrane once while 
polytopic proteins contain multiple membrane-spanning segments. TM proteins that act as cell- 
surface receptor proteins involved in signal transduction include growth and differentiation factor 
10 receptors, and receptor-interacting proteins such as Drosonhila pecanex and frizzled proteins, LIV-1 
protein, NF2 protein, and GNS1/SUR4 eukaryotic integral membrane proteins. TM proteins also act 
as transporters of ions or metabolites, such as gap junction channels (connexins) and ion channels, 
and as cell anchoring proteins, such as lectins, integrins, and fibronectins. TM proteins act as vesicle 
organelle-forming molecules, such as calveolins, or as cell recognition molecules, such as cluster of 
15 differentiation (CD) antigens, glycoproteins, and mucins. Information on connexin can be found in 
Kanter, H.L. et al. (1994; J. Mol. Cell. Cardiol. 26:861-868). 

Many membrane proteins (MPs) contain amino acid sequence motifs that target these proteins 
to specific subcellular sites. Examples of these motifs include PDZ domains, KDEL, ROD, NGR, and 
GSL sequence motifs, von Willebrand factor A (vWFA) domains, and EGF-like domains. ROD, 
20 NGR, and GSL motif-containing peptides have been used as drug delivery agents in cancer treatments 
which target tumor vasculature (Arap, W. et al. (1998) Science, 279:377-380.) Furthermore, MPs 
may also contain amino acid sequence motifs, such as the carbohydrate recognition domain (CRD), 
also known as the C-type lectin domain, that mediate interactions with extracellular or intracellular 
molecules. 

25 G-protein coupled receptors (GPCR) comprise a superfamily of integral membrane proteins 

which transduce extracellular signals. GPCRs include receptors for biogenic amines, lipid mediators 
of inflammation, peptide hormones, and sensory signal mediators. The structure of these 
highly-conserved receptors consists of seven hydrophobic transmembrane regions, an extracellular 
N-terminus, and a cytoplasmic C-terminus. Three extracellular loops alternate with three intracellular 

30 loops to link the seven transmembrane regions. The most conserved parts of these proteins are the 
transmembrane regions and the first two cytoplasmic loops. Cysteine disulfide bridges connect the 
second and third extracellular loops. A conserved. acidic-Arg-aromatic residue triplet present in the 
second cytoplasmic loop may interact with G proteins. A GPCR consensus pattern is characteristic of 
most proteins belonging to this superfamily (ExPASy PROSITE document PS00237; and Watson, 
35 S. and S. Arkinstall (1994) The G-protein Linked tonmrEri. *n* Academic Press, San Diego 
CA, pp 2-6). Mutations and changes in transcriptional activation of GPCR-encoding genes have been 
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associated with neurological disorders such as schizophrenia, Parkinson's disease, Alzheimer's 
disease, drug addiction, and feeding disorders. 

Cytochromes are electron-transferring proteins that contain a heme prosthetic group a 
porphyrin ring containing a tightly bound iron atom. Cytochromes act as oxidoreductases in such 
5 Averse cellular processes as respiration, photosynthesis, fatty acid metabolism, and neurotransmitter 
b.osymhes.s. The heme iron atom serves as the actual electron carrier by changing from the ferric to 
the ferrous oxidation state when accepting an electron. Cytochromes accept electrons from one 
substrate such as NADH or ascorbate and donate them to other electron carriers such as other 
cytochromes, ubiquinone, or semidehydroascorbic acid (Lodish. H. etal. (1 995) Molecu.amn. 
«0 Bjology, Scientific American Books. New York NY, pp. 759-770. 786-797; Sperling. P. et al (1995) 
Eur. J. B.ochem. 232:798-805; and Online Mendelian Inheritance in Man (OMIM) *600019 
Cytochrome b561, CYB561). 

Cytochrome b5 is an electron donor in membrane-linked redox enzyme systems involved in 
hp* and drug metabolism. Cytochrome b5 has been found in Golgi. piasma, outer mitochondrial 
15 endoplasmic reticulum (ER). and microbody membranes. Conserved amino acids in cytochrome b5 
■nclude e.ght invariant amino acids at W34, H51, P52, G53, G54, G63, F70. and H74, and fifteen 
conserved amino acids at L24. D5. S36. V41, Y42. N43, T45. W47, A48, L58, D65, T67, L85 T87 
and G88 (numbering based on the sunflower cytochrome b5/del,a-6 desaturase fusion protein; GI 
1040729. Sperling, supra). The invariant residues H51PGG are involved in heme-binding 
20 Cytochrome b5-Iike domains have also been found linked to other enzymes. For example 
cytochrome b5-lik e domains are part of del,a-9 fatty acid desaturases in yeast and Histoid 
car^ularum, nitrate reductase, sulfite reductase, flavocytochrome b2. Ar a bido, si „h alia n a acyi Iipid 
desaturase. and Bora^o officinalis (borage) and Helianthus annnus (sunflower) delta-6 desaturases 
(Sperlmg, supra; Sayanova, O. et al (1997) Proc. Natl. Acad. Sci. USA 94:421 1-4216; and Mitchell 
25 A.G. and C.E. Martin (1997) J. Biol. Chem. 272:2828 1 -28288). 

Signal peptides are found on proteins that are targeted to the endoplasmic reticulum (ER) 
S.gnal peptides consist of stretches of amino acids enriched in hydrophobic residues. Signal peptides 
are usually found at the extreme N-terminus of the protein and are recognized by a cytoso.ic signal- 
recognition peptide (SRP). The SRP binds to the signal peptide and to an SRP receptor, an integral 
30 membrane protein in the ER. Once bound to the SRP receptor, the newly formed protein containing 
the s.gnal peptide is translocated across the ER membrane. Proteins containing signal peptides may 
end up mserted into the lipid bilayer, or they may end up in the lumen of an organelle or secreted from 
the cell. 

35 Disease CnrrHatirm 

The etiology of numerous human diseases and disorders can be attributed to defects in the 
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transpon of molecules across membranes. Defects in the trafficking of membrane-bound transporters 
and ,on channels are associated with severa. disorders, e.g. cystic fibrosis, glucose-galactose 
malabsorption syndrome, hypercholesterolemia, von Gierke disease, and certain forms of diabetes 
melhtus. Single-gene defect diseases resulting in an inability to transport small molecules across 
membranes mclude, e.g., cystinuria, iminoglycinuria, Hartup disease, and Fanconi disease (van* Hoff 
W.G. (1996) Exp. Nephrol. 4:253-262; Talente. G.M. et al. (1994) Ann. Intern. Med. 120:218-226; 
and Chillon. M. et al. (1995) New Engl. J. Med. 332: 1475-1480). 

Cystinuria is an inherited disease that results from the inability to transport cystine, the 
disulfide-linked dimer of cysteine, from the urine into the blood. Accumulation of cystine in the 
urine leads to the formation of cystine stones in the kidneys. 

Transthyretin (TTR), present in human plasma, binds to and transports the thyroid hormone 
thyroxine. Mutations in TTR result in the conversion of TTR to amyloid, an insoluble fibrillar 
structure. The resulting amyloid plaques have been shown to be the causative agent in the 
development of familial amyloid polyneuropathy and senile systemic amyloidosis (Miroy, G.J. e, al. 

(1996) Proc. Natl. Acad. Sci. USA 93:15051-15056). 

Stomatin, a 31-kDa erythrocyte integral membrane protein has been linked to the hereditary 
anemia stomatocytosis. This anemia is characterized by red blood cells that lack stomatin and leak 
Na + and K + . Thus, stomatin is presumed to play a role in the regulation of ion transpon. Red blood 
cell ,on transport defects are also linked to other disorders such as hypertension (Stewart, G.W. 

(1997) Int. J. Biochem. Cell Biol. 29:271-274). 
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The discovery of new human transport proteins and the polynucleotides encoding l 
satisfies a need in the art by providing new compositions which are useful in the diagnosis 
prevention, and treatment of transport, metabolic, neurological, cardiovascular, reproductive, and 
immune disorders, and cell proliferative disorders including < 



; cancer. 



SUMMARY OF THE INVENTION 

The invention features purified polypeptides, human transport proteins, referred to 
collectively as "TPPT" and individually as "TPPT-1," "TPPT-2," "TPPT-3 " "TPPT-4 " "TPPT 5 " 

•TPPT-14," "TPPT-1 5," "TPPT-16," "TPPT-17," "TPPT-18," 'TPPT-19," "TPPT-20 " "TPPT-21 " 
•TPPT-22," "TPFT-23," "TPFT-24 - "TPPT-25." 'TPPT-26," -TPPT-27," 'TPPT-28,' ' "TPPT-29" 
"TPFr-30," TPPT-3, - -rm-32.- "TPPT-33." "TPPT-34," "TPPT-35," TPPT-36," ^-37" 
•TPPT-38," .TPPT-39," "TPPT-40," "TPPT-4 1 - -TPPT-42," and 'TPPT-43." In one aspect, the ' 
35 inventron provides an isolated polypeptide comprising an amino acid sequence selected from the 
group consisting of a) an amino acid sequence selected from the group consisting of SEQ ID NO:l- 
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43, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an amino 
ac.d sequence selected from the group consisting of SEQ ID NO: 1 -43. c) a biologically active 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-43. and d) an 
immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID 
5 NO:M3. In one alternative, the invention provides an isolated polypeptide comprising the amino 
acid sequence of SEQ ID NO: 1 -43. 

The invention further provides an isolated polynucleotide encoding a polypeptide comprising 
an ammo acid sequence selected from the group consisting of a) an amino acid sequence selected 
from the group consisting of SEQ ID NO:l-43, b) a naturally occurring amino acid sequence having at 
.0 least 90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID 
NO:l-43. c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-43. and d) an immunogenic fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-43. In one alternative, the polynucleotide encodes a 
polypepude selected from the group consisting of SEQ ID NO:l-43. In another alternative, the 
15 polynucleotide is selected from the group consisting of SEQ ID NO:44-86. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide comprising an amino acid 
sequence selected from the group consisting of a) an amino acid sequence selected from the group 
cons.sting of SEQ ID NO :1 -43, b) a naturally occurring amino acid sequence having at leas, 90% 
sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:M3 
c) a biologically active fragment of an amino acid sequence selected from the group consisting of ' 
SEQ ID NO: 1-43. and d) an immunogenic fragment of an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-43. In one alternative, the invention provides a cell transformed 
wtth the recombinant polynucleotide. In another alternative, the invention provides a transgenic 
25 organism comprising the recombinant polynucleotide. 

The invention also provides a method for producing a polypeptide comprising an amino acid 
sequence selected from the group consisting of a) an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-43, b) a naturally occurring amino acid sequence having at least 90% 
sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO- 1-43 
30 c) a biologically active fragment of an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1 -43, and d) an immunogenic fragment of an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-43. The method comprises a) ot.turing a cell under conditions 
suable for expression of the polypeptide, wherein said cell is transformed with a recombinant 
polynucleotide comprising a promoter sequence operably linked to a polynucleotide encoding the 
35 polypeptide, and b) recovering the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds , a 
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polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino 
acid sequence selected from the group consisting fSEQ ID NO: 1-43, b) a naturally occurring amino 
acid sequence having at least 90% sequence identity to an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-43, c) a biologically active fragment of an amino acid sequence 
5 selected from the group consisting of SEQ ID NO: 1-43. and d) an immunogenic fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-43. 

The invention further provides an isolated polynucleotide comprising a polynucleotide 
sequence selected from the group consisting of a) a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:44-86, b) a naturally occurring polynucleotide sequence having at least 70% 
10 sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:44- 
86, c) a polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary 
to b), and e) an RNA equivalent of a)-d). In one alternative, the polynucleotide comprises at least 60 
contiguous nucleotides. 

Additionally, the invention provides a method for detecting a target polynucleotide in a 
15 sample, said target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide 
sequence selected from the group consisting of a) a polynucleotide sequence selected from the group 
consisting of SEQ ID NO.44-86, b) a naturally occurring polynucleotide sequence having at least 70% 
sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:44- 
86, c) a polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary 
20 to b), and e) an RNA equivalent of a)-d). The method comprises a) hybridizing the sample with a 
probe comprising at least 20 contiguous nucleotides comprising a sequence complementary to said 
target polynucleotide in the sample, and which probe specifically hybridizes to said target 
polynucleotide, under conditions whereby a hybridization complex is formed between said probe and 
said target polynucleotide or fragments thereof, and b) detecting the presence or absence of said 
25 hybridization complex, and optionally, if present, the amount thereof. In one alternative, the probe 
comprises at least 60 contiguous nucleotides. 

The invention further provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide 
sequence selected from the group consisting of a) a polynucleotide sequence selected from the group 
30 consisting of SEQ ID NO:44-86, b) a naturally occurring polynucleotide sequence having at least 70% 
sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:44- 
86, c) a polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary 
to b), and e) an RNA equivalent of a)-d). The method comprises a) amplifying said target 
polynucleotide or fragment thereof using polymerase chain reaction amplification, and b) detecting 
35 the presence or absence of said amplified target polynucleotide or fragment thereof, and, optionally, if 
present, the amount thereof. 
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The invention further pr vides a pharmaceutical composition comprising an effective amount 
of a polypeptide comprising an amino acid sequence selected from the group consisting of a) an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1 -43, b) a naturally occurring 
amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from 
5 the group consisting of SEQ ID NO: M3, c) a biologically active fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-43, and d) an immunogenic fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO:l-43, and a pharmaceutically 
acceptable excipient. In one embodiment, the pharmaceutical composition comprises an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-43. The invention additionally provides 
10 a method of treating a disease or condition associated with decreased expression of functional TPPT, 
comprising administering to a patient in need of such treatment the pharmaceutical composition. 

The invention also provides a method for screening a compound for effectiveness as an 
agonist of a polypeptide comprising an amino acid sequence selected from the group consisting of a) 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-43, b) a naturally 
15 occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 

selected from the group consisting of SEQ ID NO: 1-43, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO:M3, and d) an immunogenic 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-43. The 
method comprises a) exposing a sample comprising the polypeptide to a compound, and b) detecting 
20 agonist activity in the sample. In one alternative, the invention provides a pharmaceutical 

composition comprising an agonist compound identified by the method and a pharmaceutically 
acceptable excipient. In another alternative, the invention provides a method of treating a disease or 
condition associated with decreased expression of functional TPPT, comprising administering to a 
patient in need of such treatment the pharmaceutical composition. 
25 Additionally, the invention provides a method for screening a compound for effectiveness as 

an antagonist of a polypeptide comprising an amino acid sequence selected from the group consisting 
of a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1 -43, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-43, c) a biologically active fragment of an amino 
30 acid sequence selected from the group consisting of SEQ ID NO: 1 -43, and d) an immunogenic 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-43. The 
method comprises a) exposing a sample comprising the polypeptide to a compound, and b) detecting 
antagonist activity in the sample. In one alternative, the invention provides a pharmaceutical 
composition comprising an antagonist compound identified by the method and a pharmaceutically 
35 acceptable excipient. In another alternative, the invention provides a method of treating a disease or 
condition associated with overexpression of functional TPPT, comprising administering to a patient in 
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need of such treatment the pharmaceutical composition. 

The invention further provides a method of screening for a compound that specifically binds 
to a polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-43, b) a naturally occurring amino 
5 acid sequence having at least 90% sequence identity to an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-43, c) a biologically active fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-43. and d) an immunogenic fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-43. The method comprises a) 
combining the polypeptide with at least one test compound under suitable conditions, and b) 
10 detecting binding of the polypeptide to the test compound, thereby identifying a compound that 
specifically binds to the polypeptide. 

The invention further provides a method of screening for a compound that modulates the 
activity of a polypeptide comprising an amino acid sequence selected from the group consisting of a) 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-43, b) a naturally 
15 occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-43, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-43, and d) an immunogenic 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-43. The 
method comprises a) combining the polypeptide with at least one test compound under conditions 
20 permissive for the activity of the polypeptide, b) assessing the activity of the polypeptide in the 

presence of the test compound, and c) comparing the activity of the polypeptide in the presence of the 
test compound with the activity of the polypeptide in the absence of the test compound, wherein ; 
change in the activity of the polypeptide in the presence of the test compound is indicative of i 
compound that modulates the activity of the polypeptide. 
25 The invention further provides a method for screening a compound for effectiveness in 

altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NO:44-86, the method comprising a) 
exposing a sample comprising the target polynucleotide to a compound, and b) detecting altered 
expression of the target polynucleotide. 
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BRIEF DESCRIPTION OF THE TABLES 

Table 1 shows polypeptide and nucleotide sequence identification numbers (SEQ ID NOs), 
clone identification numbers (clone IDs), cDNA libraries, and cDNA fragments used to assemble full- 
length sequences encoding TPPT. 

Table 2 shows features of each polypeptide sequence, including potential motifs, homologous 
sequences, and methods, algorithms, and searchable databases used for analysis of TPPT. 
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Table 3 shows selected fragments of each nucleic acid sequence; the tissue-specific 
expression patterns of each nucleic acid sequence as determined by northern analysis; diseases, 
disorders, or conditions associated with these tissues; and the vector into which each cDNA was 
cloned. 

Table 4 describes the tissues used to construct the cDNA libraries from which cDNA clones 
encoding TPPT were isolated. 

Table 5 shows the tools, programs, and algorithms used to analyze the polynucleotides and 
polypeptides of the invention, along with applicable descriptions, references, and threshold 
parameters. 



DESCRIPTION OF THE INVENTION 
Before the present proteins, nucleotide sequences, and methods are described, it is understood 
that this invention is not limited to the particular machines, materials and methods described, as these 
may vary. It is also to be understood that the terminology used herein is for the purpose of describing 
particular embodiments only, and is not intended to limit the scope of the present invention which will 
be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," "an." 
and 'the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a 
20 reference to one or more antibodies and equivalents thereof known to those skilled in the an, and so 
forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meanings as commonly understood by one of ordinary skill in the art to which this invention belongs. 
Although any machines, materials, and methods similar or equivalent to those described herein can be 

25 used to practice or test the present invention, the preferred machines, materials and methods are now 
described. All publications mentioned herein are cited for the purpose of describing and disclosing 
the cell lines, protocols, reagents and vectors which are reported in the publications and which might 
be used in connection with the invention. Nothing herein is to be construed as an admission that the 
invention is not entitled to antedate such disclosure by virtue of prior invention. 

30 DEFINITIONS 

"TPPT" refers to the amino acid sequences of substantially purified TPPT obtained from any 
species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 

The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
35 TPPT. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of TPPT either by directly, interacting with 
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TPPT or by acting on components of the biological pathway in which TPPT participates. 

An "allelic variant" is an alternative form of the gene encoding TPPT. Allelic variants may 
result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
5 many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur alone, or in combination with the others, one or more times 
in a given sequence. 

"Altered" nucleic acid sequences encoding TPPT include those sequences with deletions, 
10 insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as TPPT or a 
polypeptide with at least one functional characteristic of TPPT. Included within this definition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe 
of the polynucleotide encoding TPPT, and improper or unexpected hybridization to allelic variants, 
with a locus other than the normal chromosomal locus for the polynucleotide sequence encoding 
15 TPPT. The encoded protein may also be "altered," and may contain deletions, insertions, or 
substitutions of amino acid residues which produce a silent change and result in a functionally 
equivalent TPPT. Deliberate amino acid substitutions may be made on the basis of similarity in 
polarity, charge, solubility, hydrophobic^, hydrophilicity, and/or the amphipathic nature of the 
residues, as long as the biological or immunological activity ofTPPT is retained. For example, 
20 negatively charged amino acids may include aspartic acid and glutamic acid, and positively charged 
amino acids may include lysine and arginine. Amino acids with uncharged polar side chains having 
similar hydrophilicity values may include: asparagine and glutamine; and serine and threonine. 
Amino acids with uncharged side chains having similar hydrophilicity values may include: leucine, 
isoleucine. and valine; glycine and alanine; and phenylalanine and tyrosine. 
25 The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 

polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 
molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally occurring 
protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid 
sequence to the complete native amino acid sequence associated with the recited protein molecule. 
30 "Amplification" relates to the production of additional copies of a nucleic acid sequence. 

Amplification is generally carried out using polymerase chain reaction (PCR) technologies well 
known in the art. 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity 
ofTPPT. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small 
35 molecules, or any other compound or composition which modulates the activity of TPPT either by 
directly interacting with TPPT or by acting on components of the biological pathway in which TPPT 
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participates. 

The term "antibody" refers t intact immunoglobulin molecules as well as to fragments 
thereof, such as Fab, F(ab% and Fv fragments, which are capable of binding an epitopic determinant. 
Antibodies that bind TPPT polypeptides can be prepared using intact polypeptides or using fragments 
5 containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide used 
to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of RNA, 
or synthesized chemically, and can be conjugated to a carrier protein if desired. Commonly used 
carriers that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, and 
keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize the animal. 
10 The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 

makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specifically to antigenic determinants (particular regions or three-dimensional structures 
on the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen 
15 used to elicit the immune response) for binding to an antibody. 

The term "antisense" refers to any composition capable of base-pairing with the "sense" 
(coding) strand of a specific nucleic acid sequence. Antisense compositions may include DNA; RNA; 
peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 
phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified 
20 sugar groups such as Z^-methoxyethyl sugars or 2'-methoxyethoxy sugars; or oligonucleotides having 
modified bases such as 5-methyl cytosine, 2'-deoxyuracil, or 7-deaza-2'-deoxyguanosine. Antisense 
molecules may be produced by any method including chemical synthesis or transcription. Once 
introduced into a cell, the complementary antisense molecule base-pairs with a naturally occurring 
nucleic acid sequence produced by the cell to form duplexes which block either transcription or 
25 translation. The designation "negative" or "minus" can refer to the antisense strand, and the 
designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
functions of a naturally occurring molecule. Likewise, "immunologically active" or "immunogenic" 
refers to the capability of the natural, recombinant, or synthetic TPPT, or of any oligopeptide thereof, 
30 to induce a specific immune response in appropriate animals or cells and to bind with specific 
antibodies. 

"Complementary" describes the relationship between two single-stranded nucleic acid 
sequences that anneal by base-pairing. For example, S'-AGTO' pairs with its complement, 
3-TCA-5'. 

A "composition comprising a given polynucleotide sequence" and a "composition comprising 
a given amino acid sequence" refer broadly to any composition containing the given polynucleotide or 
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am.no acid sequence. The composition may comprise a dry formulation or an aqueous solution. 
Comp sitions comprising polynucleotide sequences encoding TPPT or fragments of TPPT may be 
employed as hybridization probes. The probes may be stored in freeze-dried form and may be 
associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 
deployed in an aqueous solution containing salts (e.g., Nad), detergents (e.g., sodium dodecyl sulfate- 
SDS), and other components (e.g., Denhardt's solution, dry milk, salmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been subjected to repeated 
DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (PE Biosystems 
Foster City CA) in the 5" and/or the 3' direction, and resequence^ or which has been assembled from 
one or more overlapping cDNA, EST, or genomic DNA fragments using a computer program for 
fragment assembly, such as the GEL VIEW fragment assembly system (GCG, Madison WI) or Phrap 
(Un.verstty of Washington. Seattle WA). Some sequences have been both extended and assembled to 
produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least 
interfere with the properties of the original protein, i.e. the structure and especially the function of the 
protem ,s conserved and not significantly changed by such substitutions. The table below shows 
ammo acids which may be substituted for an original amino acid in a protein and which are regarded 
as conservative amino acid substitutions. 

-Original Residue Conservative Substitution 



40 



Ala 
Arg 
Asn 
Asp 
Cys 
Gin 
Glu 
Gly 
His 
He 
Leu 
Lys 
Met 
Phe 
Ser 
Thr 
Trp 
Tyr 
Val 



Gly, Ser 
His, Lys 
Asp, Gin, His 
Asn, Glu 
Ala, Ser 
Asn, Glu, His 
Asp, Gin, His 
Ala 

Asn, Arg, Gin, Glu 
Leu, Val 
He, Val 
Arg, Gin, Glu 
Leu, He 

His, Met, Leu, Tip, Tyr 

Cys, Thr 

Ser, Val 

Phe, Tyr 

His, Phe, Trp 

He. Leu, Thr 



Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 
backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation 
(b) the charge or hydrophobic^ of the molecule at the site of the substitution, and/or (c) the bulk of 

the side chain. 
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A "deletion" refers to a change in the amino acid or nucle tide sequence that results in the 
absence f one or more amin acid residues or nucleotides. 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide. 
Chemical modifications of a polynucleotide sequence can include, for example, replacement of 
hydrogen by an alkyl. acyl, hydroxy!, or amino group. A derivative polynucleotide encodes a 
polypeptide which retains at least one biological or immunological function of the natural molecule. 
A derivative polypeptide is one modified by glycosylate, pegylation, or any similar process that 
retains at least one biological or immunological function of the polypeptide from which it was 



derived 

10 



A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

A "fragment" is a unique portion of TPPT or the polynucleotide encoding TPPT which is 
identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up 
to the ennre length of the defined sequence, minus one nucleotide/amino acid residue. For example, a 
15 fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. Afragment 
used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 15. 
16,20,25,30,40,50,60,75, 100, 150, 250 or at least 500 contiguous nucleotides or amino acid 
residues in length. Fragments may be preferentially selected from certain regions of a molecule. For 
example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected 
20 from the first 250 or 500 amino acids (or first 25% or 50% of a polypeptide) as shown in a certain 
defined sequence. Clearly these lengths are exemplary, and any length that is supported by the 
specification, including the Sequence Listing, tables, and figures, may be encompassed by the present 
embodiments. 

A fragment of SEQ ID NO:44-86 comprises a region of unique polynucleotide sequence that 
25 specifically identifies SEQ ID NO.44-86, for example, as distinct from any other sequence in the 
genome from which the fragment was obtained. A fragment of SEQ ID NO:44-86 is useful, for 
example, in hybridization and amplification technologies and in analogous methods that distinguish 
SEQ ID NO:44-86 from related polynucleotide sequences. The precise length of a fragment of SEQ 
ID NO:44-86 and the region of SEQ ID NO:44-86 to which the fragment corresponds are routinely 
30 determinable by one of ordinary skill in the art based on the intended purpose for the fragment. 

A fragment of SEQ ID NO:l-43 is encoded by a fragment of SEQ ID NO:44-86. A fragment 
of SEQ ID NO: 1-43 comprises a region of unique amino acid sequence that specifically identifies 
SEQ ID NO.1-43. For example, a fragment of SEQ ID NO:l-43 is useful as an immunogenic peptide 
for the development of antibodies that specifically recognize SEQ ID NO:l-43. The precise length of 
35 a fragment of SEQ ID NO:l-43 and the region of SEQ ID NO.1-43 to which the fragment 

corresponds are routinely determinable by one of ordinary skill in the art based on the intended 
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purpose for the fragment. 

A "full-lengm" polynucleotide sequence is one containing at least a translation initiation 
codon (e.g., methionine) followed by an open reading frame and a transition termination codon. A 
full-length" polynucleotide sequence encodes a "full-length" polypeptide sequence 
5 "Homology" re f ers to sequence similarity or. interchangeably, sequence identity, bet^n two 

or more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity" and -» identity." as applied to polynucleotide sequences refer 
to the percentage of residue matches between at least two polynucleotide sequences aligned using a 
standardized algorithm. Such an algorithm may insen, in a standardized and reproducible way, gaps 
•n the sequences being compared in order to optimize alignment between two sequences, and 
therefore actoeve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3 1 2e 
sequence alignment program. This program is part of the LASERGENE software package, a suite of 
•5 molecular biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in 
Htggtns. D.G. and P.M. Sharp (1989) CABIOS 5:15.-153 and in Higgins. D.G. et al. (1992) CABIOS 
8: 89-191. For pairwise alignments of polynucleotide sequences, the default parameters are set as 
follows: Ktuole=2, gap penalty=5, wi„dow=4, and "diagonals saved"=4. The "weighted" residue 
we lg ht table is selected as the default. Percent identity is reported by CLUSTAL V as the "percent 
20 similarity" between aligned polynucleotide sequences. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms 
» prowded by the National Center for Biotechnology Information (NCBI) Basic Local Alignment 
Search Tool (BLAST) (Altschul. S.F. et al. (.990) J. Mol. Biol. 215:403^,0), which is available from 
several sources, including the NCBI, Bethesda, MD. and on the Internet at 
25 http://www.ncbi.nlmnih.gov/BLAST/. The BLAST software suite includes various sequence analysis 
programs including "b.asm," that is used to align a known polynucleotide sequence with other 
polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences "BLAST 2 
S^uences" can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/gorf/bI2.html. The 
BLAST 2 Sequences" tool can be used for bom blastn and blastp (discussed below). BLAST 
programs are common.y used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.12 (ApriI-2,-2000) set at defau.t parameters. Such default parameters may be. for example: 
Matrix: BLOSUM62 
35 Reward for match: I 

Penalty for mismatch: -2 
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Open Gap: 5 and Extension Gap: 2 penalties 
Gap x drop-off: 50 
Expect: 10 
Word Size: 11 
Filter: on 

Percent identity may be measured over the length of an entire defined sequence, for example 
as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, 
over the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at 
least 20, at leas, 30, at leas, 40, a, leas, 50, at least 70, at leas, 1 00, or at least 200 contiguous 
nucleotides. Such lengths are exemplary only, and i, is understood ,ha, any fragment length supported 
by the sequences shown herein, in the tables, figures, or Sequence Listing, may be used to describe a 
length over which percentage identity may be measured. 

Nucleic acid sequences ,ha, do no, show a high degree of identity may nevertheless encode 
s.nular ammo acid sequences due to the degeneracy of the genetic code. I, is understood that changes 
in a nucle.c acid sequence can be made using this degeneracy to produce multiple nucleic acid 
sequences that all encode substantially the same protein. 

The phrases "percent identity" and «% identity," as applied ,o polypeptide sequences, refer to 
the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some 
ahgnmem methods take into account conservative amino acid substitutions. Such conservative 
substitutions, explained in more detail above, generally preserve the charge and hydrophobic^ at the 
site of substation, thus preserving the structure (and therefore function) of the polypeptide. 

Percent identity between polypeptide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e 
sequence alignment program (described and referenced above). For pairwise alignments of 
polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktup.e=l gap 
penalty=3, window=5. and "diagonals saved"=5. The PAM250 matrix is selected as the default 
resadue weigh, table. As with polynucleotide alignments, the percent identity is reported by 
CLUSTAL V as the "percent similarity" between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
companson of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 
2.0.12 (Apr-21-2000) with blastp set at default parameters. Such default parameters may be. for 
example: 

Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 
Gap x drop-off: 50 
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Expect: 10 
Word Size: 3 
Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, 
5 for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, overthe lengti, of a fragment taken from a larger, defined polypeptide sequence, for 
instance, a fragment of at leas, 15. at least 20. at leas, 30. a, least 40, at least 50. at least 70 or at least 
150 contiguous residues. Such lengths are exemplary only, and i, is understood that any fragment 
length supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be 
10 used ,o describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements required for 
chromosome replication, segregation and maintenance. 

The term "humanized antibody" refers to an antibody molecule in which the amino acid 
15 sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing under defined hybridization conditions. Specific 
hybridization is an indication that two nucleic acid sequences share a high degree of complementarity 
10 Specific hybridization complexes form under permissive annealing conditions and remain hybridized 
after the "washing" sterfs). The washing step(s) is particularly important in determining the 
stringency of the hybridization process, with more stringent conditions allowing less non-specific 
bmding. i.e.. binding between pairs of nucleic acid strands that are not perfectly matched. Permissive 
conditions for annealing of nucleic acid sequences are routinely determinable by one of ordinary skill 
in the art and may be consistent among hybridization experiments, whereas wash conditions may be 
varied among experiments to achieve the desired stringency, and therefore hybridization specificity. 
Permissive annealing conditions occur, for example, at 68»C in the presence of about 6 x SSC. about 
1% (w/v) SDS, and about 100 ug/ml sheared, denatured salmon sperm DNA. 

Generally, stringency of hybridization is expressed, in part, with reference to the temperature 
30 under which the wash step is carried out. Such wash temperatures are typically selected to be about 
STC to 20-C lower than the thermal melting point (TJ for the specific sequence at a defined ionic 
strength and pH. The T m is the temperature (under defined ionic strength and pH) at which 50% of 
the target sequence hybridizes to a perfectly matched probe. An equation for calculating T. and 
conditions for nucleic acid hybridization are well known and can be found in Sambrook. J. et al.. 
35 1989 - Molecular Honing: A Laboratory M.n.nl . 2" ed.. vol. 1-3. Cold Spring Harbor Press. 
Plainview NY; specifically see volume 2. chapter 9. 
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High stringency conditions for hybridization between polynucleotides of the present invention 
.nclude wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1% SDS. for 1 hour. 
Alternatively, temperatures of about 65°C, 60°C, 55'C. or 42°C may be used. SSC concentration may 
be varied from about 0.1 to 2 x SSC, with SDS being present at about 0. 1 %. Typically, blocking 
5 reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, 
sheared and denatured salmon sperm DNA at about 100-200 ug/ml. Organic solvent, such as 
formamide at a concentration of about 35-50% v/v, may also be used under particular circumstances, 
such as for RNA:DNA hybridizations. Useful variations on these wash conditions will be readily 
apparent to those of ordinary skill in the art. Hybridization, particularly under high stringency 
10 conditions, may be suggestive of evolutionary similarity between the nucleotides. Such similarity is 
strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 

The term "hybridization complex" refers to a complex formed between two nucleic acid 
sequences by virtue of the formation of hydrogen bonds between complementary bases. A 
hybridization complex may be formed in solution (e.g., Qt or R,, analysis) or formed between one 
15 nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a solid 
support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate 
to which cells or their nucleic acids have been fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or nucleotide 
sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively. 
20 "Immune response" can refer to conditions associated with inflammation, trauma, immune 

disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression 
of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect 
cellular and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of TPPT which is 
25 capable of eliciting an immune response when introduced into a living organism, for example, a 

mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment 
of TPPT which is useful in any of the antibody production methods disclosed herein or known in the 



art. 



The term "rnicroarray" refers to an arrangement of a plurality of polynucleotides, 
30 polypeptides, or other chemical compounds on a substrate. 

The terms "element" and "array element" refer to a polynucleotide, polypeptide, or other 
chemical compound having a unique and defined position on a rnicroarray. 

The term "modulate" refers to a change in the activity of TPPT. For example, modulation 
may cause an increase or a decrease in protein activity, binding characteristics, or any other 
35 biological, functional, or immunological properties of TPPT. 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 
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polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
anttsense strand, to peptide nucleic acid (PNA). or to any DNA-like or RNA-like material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
5 functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Operably linked DNA sequences may be in close proximity or contiguous and. where 
necessary to join two protein coding regions, in the same reading frame, 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
10 comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
ammo ac.d residues ending in lysine. The terminal lysine confers solubility to the composition. 
PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript elongation, 
and may be pegylated to extend their lifespan in the cell. 

"Post-translational modification" of an TPPT may involve lipidation, glycosylation 
15 phosphorylation, acety.ation, racemization, proteolytic cleavage, and other modifications known in 
the art. These processes may occur synthetically or biochemically. Biochemical modifications will 
vary by cell type depending on the enzymatic milieu of TPPT. 

"Probe" refers to nucleic acid sequences encoding TPPT, their complements, or fragments 
thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 
20 .solated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. 
Typ.cal labels include radioactive isotopes, ligands. chemiluminescent agents, and enzymes. 
"Pnmers" are short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target 
polynucleotide by complementary base-pairing. The primer may then be extended along the target 
DNA strand by a DNA polymerase enzyme. Primer pairs can be used for amplification (and 
25 .dentification) of a nucleic acid sequence, e.g., by the polymerase chain reaction (PGR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 
nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
be employed, such as probes and primers that comprise at least 20, 25, 30, 40. 50, 60, 70, 80, 90, 100. 
oratleast 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers 
30 may be considerably longer than these examples, and it is understood that any length supported by the 
specification, including the tables, figures, and Sequence Listing, may be used. 

Methods for preparing and using probes and primers are described in the references, for 
example Sambrook, J. e. al., 1989, Molecular pin*,,, a Laboratnrv Man „ a , ^ ^ yo| ^ Co]d 
Spring Harbor Press, Plainview NY; Ausubel, F.M. et al.,1987, Current PmtocnK in M,^ 
35 Bute Greene Publ. Assoc. & Wiley-Imersciences, New York NY; Innis, M. et al., 1990 PCR 
Protocols, A Guide to Methods and Applied- , Academic Press, San Diego CA. PCR primer pairs 
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can be derived from a kn wn sequence, for example, by using computer programs intended for that 
purpose such as Primer (Version 0.5. 1991, Whitehead Institute for Biomedical Research. Cambridge 
MA). B 

Oligonucleotides for use as primers are selected using software known in the art for such 
5 purpose. For example. OUGO 4.06 software is useful for the selection of PGR primer pairs of up ,o 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 
5,000 nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer 
se.ect.on programs have incorporated additional features for expanded capabilities. For example, the 
PnmOU pnmer selection program (available to the public from the Genome Center at University of 
.0 Texas South West Medical Center, Dallas TX) is capable of choosing specific primers from megabase 
sequences and is thus useful for designing primers on a genome-wide scope. The Primer3 primer 
select™ program (available to the public from the Whitehead Institute/Mrr Center for Genome 
Research. Cambridge MA) allows the user to input a "mispriming library," in which sequences to 
avo.d as pnmer binding sites are user-specified. Primer3 is useful, in particular, for the selection of 
«5 ohgonucleotides for microarrays. (The source code for the latter two primer selection programs may 
also be obtained from their respective sources and modified to meet the user's specific needs.) The 
PnmeGen program (available to the public from the UK Human Genome Mapping Project Resource 
Centre, Cambndge UK) designs primers based on multiple sequence alignments, thereby allowing 
selecuon of primers that hybridize to either the most conserved or least conserved regions of aligned 
20 nucle.c acid sequences. Hence, this program is useful for identification of both unique and conserved 
ohgonucleotides and polynucleotide fragment, The oligonucleotides and polynucleotide fragments 
■dennfied by any of the above selection methods are useful in hybridization technologies, for example 
as PCR or sequencng primers, microarray elements, or specific probes to identify fully or partially 
complementary polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection 
25 are not limited to those described above. 

A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 
that ,s made by an artificial combination of two ormore otherwise separated segments of sequence 
Th.s amficial combination is often accomplished by chemical synthesis or, more commonly, by the 
aruficial manipulation of isolated segments of nucleic acids, e.g.. by genetic engineering techniques 
30 such as those described in Sambrook, sjmja. The term recombinant includes nucleic acids that have 
been altered so.ely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently a 
recombinant nucleic acid may include a nucleic acid sequence operab.y linked to a promoter 
sequence. Such a recombinant nucleic acid may be part of a vector that is used, for example, to 
transform a cell. 

35 Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 

vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
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expressed, inducing a protective immunological response in the mammal. 

A "regulatory element" refers to a nucleic acid sequence usually derived from untranslated 
regtons of a gene and includes enhancers, promoters, introns, and 5" and 3' untranslated regions 
(UTRs). Regulatory elements interact with host or viral proteins which control transcription, 
5 translation, or RNA stability. 

"Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid 
anuno ac.d. or antibody. Reporter molecules include radionuclides; enzymes; fluorescent 
chemiluminescent, or chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and 
other moieties known in the art. 

10 An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 

sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 

instead of deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing nucleic 
15 acids encoding TPPT. or fragments thereof, or TPPT itself, may comprise a bodily fluid; an extract 
from a cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA RNA or 
cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 
protem or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
20 synthetic binding composition. The interaction is dependent upon the presence of a particular 
structure of the protein, e.g., the antigenic determinant or epitope, recognized by the binding 
molecule. For example, if an antibody is specific for epitope "A," the presence of a polypeptide 
comprising the epitope A, or the presence of free unlabeled A, in a reaction containing free labeled A 
and the antibody will reduce the amount of labeled A that binds to the antibody. 
25 The term "substantially purified" refers to nucleic acid or amino acid sequences that are 

removed from their natural environment and are isolated or separated, and are at least 60% free 
preferably at least 75% free, and most preferably at least 90% free from otlier components with which 
they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acid residues or nucleotides 
30 by different amino acid residues or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters 
chips, sl.des, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, p.ns, channels and pores, to which polynucleotides or polypeptides are bound. 
35 A "transcript image" refers to the collective pattern of gene expression by a particular cell 

type or tissue under given conditions at a given time. 



26 



10 



WO 00/78953 PCT/USOO/16668 

'Transformation" describes a process by which exogenous DNA is introduced into a recipient 
cell. Transformation may occur under natural or artificial conditions according to various methods 
well known in the art, and may rely on any known method for the insertion of foreign nucleic acid 
sequences into a prokaryotic or eukaryotic host cell. The method for transformation is selected based 
on the type of host cell being transformed and may include, but is not limited to, bacteriophage or 
viral infection, electroporation, heat shock, lipofection, and particle bombardment. The term 
"transformed" cells includes stably transformed cells in which the inserted DNA is capable of 
replication either as an autonomously replicating plasmid or as part of the host chromosome, as well 
as transiently transformed cells which express the inserted DNA or RNA for limited periods of time. 

A "transgenic organism," as used herein, is any organism, including but not limited to animals 
and plants, in which one or more of the cells of the organism contains heterologous nucleic acid 
introduced by way of human intervention, such as by transgenic techniques well known in the art. 
The nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor of 
the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with a 
15 recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in 
vitro fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The 
transgenic organisms contemplated in accordance with the present invention include bacteria, 
cyanobacteria, fungi, plants, and animals. The isolated DNA of the present invention can be 
introduced into the host by methods known in the art, for example infection, transfection, 
20 transformation or transconjugation. Techniques for transferring the DNA of the present invention 
into such organisms are widely known and provided in references such as Sambrook et al. (1989), 
supra . 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having 
at least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of 
the nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at 
least 60%. at least 70%, at least 80%, at least 85%, at least 90%, at least 95% or at least 98% or 
greater sequence identity over a certain defined length. A variant may be described as. for example, 
an "allelic" (as defined above), "splice." "species," or "polymorphic" variant. A splice variant may 
30 have significant identity to a reference molecule, but will generally have a greater or lesser number of 
polynucleotides due to alternative splicing of exons during mRNA processing. The corresponding 
polypeptide may possess additional functional domains or lack domains that are present in the 
reference molecule. Species variants are polynucleotide sequences that vary from one species to 
another. The resulting polypeptides generally will have significant amino acid identity relative to 
each other. A polymorphic variant is a variation in the polynucleotide sequence of a particular gene 
between individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
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polymorphisms" (SNPs) in which the polynucleotide sequence varies by one nucleotide base. The 
presence of SNPs may be indicative of. for example, a certain population, a disease state, or a 
propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 
5 at least 40% sequence identity to the particular polypeptide sequence over a certain length of one of 
the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1 999) set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at 
least 60%. at least 70%, at least 80%. at least 90%, at least 95%, or at least 98% or greater sequence 
identity over a certain defined length of one of the polypeptides. 
10 THE INVENTION 

The invention is based on the discovery of new human transport proteins (TPPT). the 
polynucleotides encoding TPPT, and the use of these compositions for the diagnosis, treatment, or 
prevention of transport, metabolic, neurological, cardiovascular, reproductive, and immune disorders, 
and cell proliferative disorders including cancer. 
>5 Table 1 lists the Incyte clones used to assemble full length nucleotide sequences encoding 

TPPT. Columns 1 and 2 show the sequence identification numbers (SEQ ID NOs) of the polypeptide 
and nucleotide sequences, respectively. Column 3 shows the clone IDs of the Incyte clones in which 
nucleic acids encoding each TPPT were identified, and column 4 shows the cDNA libraries from 
which these clones were isolated. Column 5 shows Incyte clones and their corresponding cDNA 
libraries. Clones for which cDNA libraries are not indicated were derived from pooled cDNA 
libraries. In some cases, GenBank sequence identifiers are also shown in column 5. The Incyte 
clones and GenBank cDNA sequences, where indicated, in column 5 were used to assemble the 
consensus nucleotide sequence of each TPPT and are useful as fragments in hybridization 
technologies. 

25 The columns of Table 2 show various properties of each of the polypeptides of the invention: 

column 1 references the SEQ ID NO; column 2 shows the number of amino acid residues in each 
polypeptide; column 3 shows potential phosphorylation sites; column 4 shows potential glycosylation 
sites; column 5 shows the amino acid residues comprising signature sequences and motifs; column 6 
shows homologous sequences as identified by BLAST analysis; and column 7 shows analytical 

30 methods and in some cases, searchable databases to which the analytical methods were applied. The 
methods of column 7 were used to characterize each polypeptide through sequence homology and 
protein motifs. 

The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions 
associated with nucleotide sequences encoding TPPT. The first column of Table 3 lists the nucleotide 
35 SEQIDNOs. Column 2 lists fragments of the nucleotide sequences of column 1. These fragments 
are useful, for example, in hybridization or amplification technologies to identify SEQ ID NO:44-86 
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and to distinguish between SEQ ID NO:44-86 and related polynucleotide sequences. The 
polypeptides encoded by these fragments are useful, for example, as immunogenic peptides. Column 
3 lists tissue categories which express TPPT as a fraction of total tissues expressing TPPT. Column 4 
lists diseases, disorders, or conditions associated with those tissues expressing TPPT as a fraction of 
5 total tissues expressing TPPT. Column 5 lists the vectors used to subclone each cDNA library. 

Of particular interest is the expression of SEQ ID NO:50 exclusively in cardiovascular tissue, 
the expression of SEQ ID NO:56 in nervous and gastrointestinal tissues, the expression of SEQ ID 
NO:57 in gastrointestinal tissues, and the expression of SEQ ID NO:66 in nervous system tissues. Of 
particular note is the tissue-specific expression of SEQ ID NO:75. Over 71 % of the cDNA libraries 
10 expressing SEQ ID NO:75 are derived from lung tissue. 

The columns of Table 4 show descriptions of the tissues used to construct the cDNA libraries 
from which cDNA clones encoding TPPT were isolated. Column 1 references the nucleotide SEQ ID 
NOs, column 2 shows the cDNA libraries from which these clones were isolated, and column 3 shows 
the tissue origins and other descriptive information relevant to the cDNA libraries in column 2. 
15 SEQ ID NO:44 maps to chromosome 7 within the interval from 38.80 to 42.10 centiMorgans. 

SEQ ID NO.-48 maps to chromosome X within the interval from 107.90 to 122.80 centiMorgans. 
SEQ ID NO:60 maps to chromosome 2 within the interval from 157.0 to 167.0 centiMorgans. SEQ 
ID NO:65 maps to chromosome 2 within the interval from 17.4 to 40.7 centiMorgans and to 
chromosome 5 within the interval from 61.1 to 69.6 centiMorgans. The interval on chromosome 5 
20 from 61.1 to 69.6 centiMorgans also contains genes associated with Cockayne syndrome. SEQ ID 
NO:69 maps to chromosome 3 within the interval from 157.40 to 162.00 centiMorgans. SEQ ID 
NO:70 maps to chromosome 3 within the interval from 176.40 to 179.80 centiMorgans. SEQ ID 
NO:71 maps to chromosome 18 within the interval from the p-terminus to 52.30 centiMorgans. SEQ 
ID NO:73 maps to chromosome 17 within the interval from 75.70 to 84.20 centiMorgans, and to 
25 chromosome 2 within the interval from 204.70 to 209.30 centiMorgans. SEQ ID NO:76 maps to 
chromosome 20 within the interval from 79.00 to 94.40 centiMorgans. SEQ ID NO:80 maps to 
chromosome 18 within the interval from 1 .60 to 6.20 centiMorgans, and to chromosome 11 within the 
interval from 1 17.90 to 126.00 centiMorgans. SEQ ID NO:83 maps to chromosome 17 within the 
interval from 67.60 to 69.30 centiMorgans, and from 83.8 centiMorgans to the q-terminus, and to 
30 chromosome 7 within the interval from 105.20 to 1 14.50 centiMorgans . 

The invention also encompasses TPPT variants. A preferred TPPT variant is one which has 
at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid 
sequence identity to the TPPT amino acid sequence, and which contains at least one functional or 
structural characteristic of TPPT. 
35 The invention also encompasses polynucleotides which encode TPPT. In a particular 

embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected 
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from the group consisting of SEQ ID NO:44-86. which encodes TPPT. The po.ynuc.eotide sequences 
of SEQ ID NO:44-86, as presented in the Sequence Listing, embrace the equivalent RNA sequences 
where* occurrences of the nitrogenous base thymine are replaced with uracil, and the sugar backbone 
is composed of ribose instead of deoxyribose. 
5 The invention also encompasses a variant of a polynucleotide sequence encoding TPPT In 

parucular, such a variant polynucleotide sequence will have at least about 70%, or alternatively at 
least about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide 
sequence encoding TPPT. A particular aspect of the invention encompasses a variant of a 
polynucleotide sequence comprising a sequence selected from the group consisting of SEQ ID NO-44- 
10 86 wh.ch has at least about 70%, or alternatively at leas, about 85%, or even at least about 95% 
polynucleotide sequence identity to a nucleic acid sequence selected from the group consisting of 
SEQ ID NO:44-86. Any one of the polynucleotide variants described above can encode an amino acid 
sequence wh.ch contains at least one functional or structural characteristic of TPPT. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the 
15 geneuc code, a multitude of polynucleotide sequences encoding TPPT, some bearing minima, 
simrlanty to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
sequence that could be made by selecting combinations based on possible codon choices These 
combmanons are made in accordance with the standard triplet genetic code as applied to the 
10 polynucleotide sequence of naturally occurring TPPT. and all such variations are to be considered as 
being specifically disclosed. 

Although nucleotide sequences which encode TPPT and its variants are generally capable of 
hybndtzmg to the nucleotide sequence of the naturally occurring TPPT under appropriately selected 
cond.trons of stnngency. it may be advantageous to produce nucleotide sequences encoding TPPT or 
5 ,ts denvauves possessing a substantially different codon usage, e.g., inclusion of non-naturally 
occurnng codons. Codons may be seated to increase the rate at which expression of the peptide 
occurs ,n a partrcular prokaryotic or eukaryotic host in accordance with the frequency with which 
part.cular codons are utilized by the host. Other reasons for substantially altering the nucleotide 
sequence encoding TPPT and its derivatives without altering the encoded amino acid sequences 
mclude the production of RNA transcripts having more desirable properties, such as a greater 
half-hfe, than transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode TPPT and 
TPPT denvatives. or fragments thereof, entirely by synthetic chemistry. After production, the 
synthenc sequence may be mserted i nt0 any of the many available expression vectors and cell systems 
us,ng reagents well known in the art. Moreover, synthetic chemistry may be used to introduce 
mutat.ons mto a sequence encoding TPPT or any fragment thereof. 
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Also encompassed by the invention are polynucie tide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and. in particular, to those shown in SEQ ID 
NO:44-86 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and 
S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel. A.R. (1987) Methods Enzymol. 
152:507-511.) Hybridization conditions, including annealing and wash conditions, are described in 
"Definitions." 

Methods for DNA sequencing are well known in the art and may be used to practice any of 
the embodiments of the invention. The methods may employ such enzymes as the Klenow fragment 
of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (PE 
Biosystems, Foster City CA), thermostable T7 polymerase (Amersham Pharmacia Biotech, 
Piscataway NJ), or combinations of polymerases and proofreading exonucleases such as those found 
in the ELONGASE amplification system (Life Technologies, Gaithersburg MD). Preferably, 
sequence preparation is automated with machines such as the MICROLAB 2200 liquid transfer 
system (Hamilton, Reno NV), PTC200 thermal cycler (MJ Research, Watertown MA) and ABI 
15 CATALYST 800 thermal cycler (PE Biosystems). Sequencing is then carried out using either the 
ABI 373 or 377 DNA sequencing system (PE Biosystems), the MEGABACE 1 000 DNA sequencing 
system (Molecular Dynamics, Sunnyvale CA), or other systems known in the art. The resulting 
sequences are analyzed using a variety of algorithms which are well known in the art. (See, e.g., 
Ausubel, F.M. (1997) Short Protocols in MoWnhr Biolopy, John Wiley & Sons, New York NY unit 
20 7.7; Meyers, R.A. (1995) Molecular Biology and Bio^hnn.^ Wiley VCH, New York NY pp 
856-853.) 

The nucleic acid sequences encoding TPPT may be extended utilizing a partial nucleotide 
sequence and employing various PCR-based methods known in the art to detect upstream sequences, 
such as promoters and regulatory elements. For example, one method which may be employed, 

25 restriction-site PGR, uses universal and nested primers to amplify unknown sequence from genomic 
DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 2:318-322.) 
Another method, inverse PCR, uses primers that extend in divergent directions to amplify unknown 
sequence from a circularized template. The template is derived from restriction fragments comprising 
a known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids 

30 Res. 16:8186.) A third method, capture PCR, involves PCR amplification of DNA fragments adjacent 
to known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, M. et al. 
(1991) PCR Methods Applic. 1:111-1 19.) In this method, multiple restriction enzyme digestions and 
hgafons may be used to insert an engineered double-stranded sequence into a region of unknown 
sequence before performing PCR. Other methods which may be used to retrieve unknown sequences 

35 are known in the art. (See. e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-3060). 

Additionally, one may use PCR, nested primers, and PROMOTERFINDER libraries (Clomech. Palo 
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Alto CA) to walk genomic DNA. This procedure avoids the need to screen libraries and is useful in 
finding intron/ex njunctions. For all PCR-based methods, primers may be designed using 
commercially available software, such as OLIGO 4.06 Primer Analysis software (National 
Biosciences, Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in 
5 length, to have a GC content of about 50% or more, and to anneal to the template at temperatures of 
about 68°C to 72"C. 

When screening for full-length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) 
10 library does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence 
into 5' non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze 
the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
15 specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, PE Biosystems), and the entire 
process from loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 
20 which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof which 
encode TPPT may be cloned in recombinant DNA molecules that direct expression of TPPT, or 
fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy 
of the genetic code, other DNA sequences which encode substantially the same or a functionally 
25 equivalent amino acid sequence may be produced and used to express TPPT. 

The nucleotide sequences of the present invention can be engineered using methods generally 
known in the art in order to alter TPPT-encoding sequences for a variety of purposes including, but 
not limited to, modification of the cloning, processing, and/or expression of the gene product. DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 
30 oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 
mediated site-directed mutagenesis may be used to introduce mutations that create new restriction 
sites, alter glycosylate patterns, change codon preference, produce splice variants, and so forth. 

The nucleotides of the present invention may be subjected to DNA shuffling techniques such 
as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 
5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat 
Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. .4:315-319) to alter or 
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improve the biological properties of TPPT. such as iu biologica] or enzymatic activity or ta ability to 
btnd ,o other molecules or compounds. DNA shuffling is a process by which a library of gene 
vanants ,s produced using PCR-mediated recombination of gene fragments. The library is then 
subjected to selection or screening procedures that identify those gene variants with the desired 
5 properties. These preferred variants may then be pooled and further subjected to recursive rounds of 
DNA shuffling and selection/screening. Thus, genetic diversity is created through ••artificial- 
breeding and rapid molecular evolution. For example, fragment* of a single gene containing random 
pomt mutattons may be recombined, screened, and then reshuffled until the desired properties are 
opurruzed. Alternatively, fragments of a given gene may be recombined with fragments of 
10 homologous genes in the same gene family, either from the same or different species, thereby 

maxmuzmg the genetic diversity of multiple naturally occurring genes in a directed and controllable 
manner. 

In another embodiment, sequences encoding TPPT may be synthesized, in whole or in part 
usmg chemical methods well known in the art. (See, e.g.. Careers, M.H. et al. (1980) Nucleic Acids 
15 Symp. Ser. 7:215-223; and Hom, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) Alternatively 
TPPT uself or a fragment thereof may be synthesized using chemical methods. For example, peptide 
synthesis can be performed using various solution-phase or solid-phase techniques. (See e g 
Cre.ghton, T. (1984) Proteins, Struts and Molecular Property , WH Freeman, New York NY, pp 
55-60; and Roberge. J.Y. et al. (1995) Science 269:202-204.) Automated synthesis may be achieved 
20 usmg the ABI 431 A peptide synthesizer (PE Biosystems). Additionally, the amino acid sequence of 
TPPT, or any part thereof, may be altered during direct synthesis and/or combined with sequences 
from other proteins, or any part thereof, to produce a variant polypeptide or a polypeptide having a 
sequence of a naturally occurring polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
25 chromatography. (See, e.g., Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421 ) 
The composition of the synthetic peptides may be confirmed by amino acid analysis or by sequencing. 
(See, e.g., Creighton, supra, pp. 28-53.) 

In order to express a biologically active TPPT, the nucleotide sequences encoding TPPT or 
denvatives thereof may be inserted into an appropriate expression vector, i.e., a vector which contains 

30 the necessary elements for transcriptional and translation* control of the inserted coding sequence in 
a suuable host. These elements include regulatory sequences, such as enhancers, constitutive and 
mductble promoters, and 5'and 3 'untranslated regions in the vector and in polynudeotide sequences 
encodmg TPPT. Such elements may vary in their strength and specificity. Specific initiation signals 
may also be used to achieve more efficient translation of sequences encoding TPPT. Such signals 

35 mclude the ATG initiation codon and adjacent sequences, e.g. the Kozak sequence. In cases where 
sequences encoding TPPT and its initiation codon and upstream regulatory sequences are inserted into 
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the appropriate expression vector, no additional transcriptional or translation control signals may be 
needed. However, in cases where only coding sequence, or a fragment thereof, is inserted, ex genous 
translationaJ control signals including an in-frame ATG initiation codon should be provided by the 
vector. Exogenous translation^ elements and initiation codons may be of various origins, both 
5 natural and synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers 
appropriate for the particular host cell system used. (See. e.g., Scharf. D. et al. (1994) Results Probl 
Cell Differ. 20:125-162.) 

Methods which are well known to those skilled in the art may be used to construe, expression 
vectors containing sequences encoding TPPT and appropriate transcriptional and translational control 
10 elements. These methods include mvitro recombinant DNA techniques, synthetic techniques, and in 
mo genetic recombination. (See, e.g., Sambrook, J. e. al. (1989) Molecu.arC.nnin,, a , 
Manual, Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel. F.M. et al. (1995) 
Current Protocols in Molecular Biology , John Wiley & Sons, New York NY, ch. 9, 13, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 
.5 encoding TPPT. These include, but are not limited to. microorganisms such as bacteria transformed 
wtth recombinant bacteriophage, plasmid. or cosmid DNA expression vectors; yeast transformed with 
yeas, expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus)- 
plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV or 
tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or P BR322 plasmids); or 
an,mal cell systems. (See, e.g., Sambrook, suem; Ausubel, supm; Van Heeke, G. and S.M. Schuster 
(1989) J. Biol. Chem. 264:5503-5509; Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544- 
Scoter, C.A. et al. (1994) Biotechnology 12:181-184; Engelhard, E.K. e, al. (1994) Proc. Natl. Acad 
Sc. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945; Takamatsu N 
(1987) EMBO J. 6:307-31 1; Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. (1984) 
Scence 224:838-843; Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105; The McGraw Hill 
Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 191-196; Logan J 
and T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81:3655-3659; and Harrington, JJ. et al. (1997) Nat. 
Genet. 15:345-355.) Expression vectors derived from retroviruses, adenoviruses, or herpes or 
vaccinia viruses, or from various bacterial plasmids, may be used for delivery of nucleotide sequences 
30 to the targeted organ, tissue, or cell population. (See, e.g.. Di Nicola, M. et al. (1998) Cancer Gen. 
Ther. 5(6):350-356; Yu, M. et al., (1993) Proc. Natl. Acad. Sci. USA 90 ( 13):6340-6344; Buller, R M 
et al. (1985) Nature 317(6040):813-815; McGregor. D.P. et al. (1994) Mol. Immunol. 3 1(3):2 19-226- 
and Verma, I.M. and N. Somia (1997) Nature 389:239-242.) The invention is no, limited by the host 
cell employed. 

35 In bacterial systems, a number of cloning and expression vectors may be selected depending 

upon the use intended for polynucleotide sequences encoding TPPT. For example, routine cloning, 
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subcloning, and propagation of polynucleotide sequences encoding TPPT can be achieved using a 
multifunctional Ecoli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 
plasmid (Life Technologies). Ligation of sequences encoding TPPT into the vector's multiple cloning 
site disrupts the lacZ gene, allowing a colorimetric screening procedure for identification of 
5 transformed bacteria containing recombinant molecules. In addition, these vectors may be useful for 
invitro transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of 
nested deletions in the cloned sequence. (See, e.g.. Van Heeke, G. and S.M. Schuster (1989) J. Biol 
Chen, 264:5503-5509.) When large quantities of TPPT are needed, e.g. for the production of 
antibodies, vectors which direct high level expression of TPPT may be used. For example, vectors 
10 containing the strong, inducible T5 or T7 bacteriophage promoter may be used. 

Yeast expression systems may be used for production of TPPT. A number of vectors 
containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Saccharomvce. r^vi.,-- or Pjchia^ajtorjs. In addition, such 
vectors direct either the secretion or intracellular retention of expressed proteins and enable 
15 integration of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 
1995, supra; Bitter, supra : and Scorer, supra .) 

Plant systems may also be used for expression of TPPT. Transcription of sequences encoding 
TPPT may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV used alone or in 
combination with the omega leader sequence from TMV (Takamatsu. N. (1987) EMBO J. 6:307-31 1). 
20 Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock promoters may be 
used. (See, e.g., Coruzzi, sues; Broglie, supra; and Winter, supra.) These constructs can be 
introduced into plant cells by direct DNA transformation or pathogen-mediated transfection. (See, 
C g - McOraw Hill Yearbook of Science and Technnlo rY (1992) McGraw Hill, New York NY 
pp. 191-196.) 

25 In mammalian cells, a number of viral-based expression systems may be utilized. In cases 

where an adenovirus is used as an expression vector, sequences encoding TPPT may be ligated into an 
adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain 
infective virus which expresses TPPT in host cells. (See, e.g.. Logan, J. and T. Shenk (1984) Proc 
30 Natl. Acad. Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 
virus (RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV- 
based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 
DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
35 constructed and delivered via conventional delivery methods (liposomes, polycationic amino 

polymers, or vesicles) for therapeutic purposes. (See, e.g., Harrington, J.J. et al. (1997) Nat. Genet. 
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15:345-355.) 

For long term product! n of recombinant proteins in mammalian systems, stable expression of 
TPPT in cell lines is preferred. F r example, sequences encoding TPPT can be transformed into cell 
lines using expression vectors which may contain viral origins of replication and/or endogenous 
5 expression elements and a selectable marker gene on the same or on a separate vector. Following the 
introduction of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media 
before being switched to selective media. The purpose of the selectable marker is to confer resistance 
to a selective agent, and its presence allows growth and recovery of cells which successfully express 
the introduced sequences. Resistant clones of stably transformed cells may be propagated using tissue 
10 culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These 
include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 
phosphoribosyltransferase genes, for use in * and apr cells, respectively. (See, e.g., Wigler, M. et al. 
(1977) Cell 11:223-232; Lowy, I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, or 
15 herbicide resistance can be used as the basis for selection. For example, <tt/r confers resistance to 
methotrexate; neo confers resistance to the aminoglycosides neomycin and G-418; and als and pat 
confer resistance to chlorsulfuron and phosphinotricin acetyltransferase. respectively. (See, e.g., 
Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) 
J. Mot. Biol. 150: 1-14.) Additional selectable genes have been described, e.g., trpB and hisD, which 
20 alter cellular requirements for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. 
Natl. Acad. Sci. USA 85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent proteins 
(GFP; Clontech), B glucuronidase and its substrate B-glucuronide, or luciferase and its substrate 
luciferin may be used. These markers can be used not only to identify transformants, but also to 
quantify the amount of transient or stable protein expression attributable to a specific vector system. 
25 (See, e.g., Rhodes, C.A. (1995) Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of interest is 
also present, the presence and expression of the gene may need to be confirmed. For example, if the 
sequence encoding TPPT is inserted within a marker gene sequence, transformed cells containing 
sequences encoding TPPT can be identified by the absence of marker gene function. Alternatively, a 
30 marker gene can be placed in tandem with a sequence encoding TPPT under the control of a single 
promoter. Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding TPPT and that express 
TPPT may be identified by a variety of procedures known to those of skill in the art. These 
35 procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations. PCR 

amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or 
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chip based technologies for the detection and/or quantification of nucleic acid or protein sequences 

Immunological methods for detecting and measuring the expression of TPPT using either 
specie polyclonal or monoclonal antibodies are known in the art. Examples of such techniques 
include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and 
5 fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non-interfering epitopes on TPPT is preferred, but a 
compete binding assay may be employed. These and other assays are well known in the an. (See 
e.g., Hampton, R. et al. (1990) Serological Methods a Lahnn,,,™ Ma „,... ^ St Pau , ^ 
Sect. IV; Coligan, J.E. et al. (1997) Current Protocols in Immnnolo^, Greene Pub. Associates and 

10 W.ley-Interscience, New York NY; and Pound, J.D. (1998) Immunochen^l P ,. Humana 

Press, Totowa NJ.) 

A wide variety of labels and conjugation techniques are known by those skilled in the art and 
may be used in various nucleic acid and amino acid assays. Means for producing labeled 
hybridization or PCR probes for detecting sequences related to polynucleotides encoding TPPT 
15 .nclude oligo.abe.ing, nick translation, end-labeling, or PCR amplification using a labeled nucleotide 
Alternately, the sequences encoding TPPT, or any fragments thereof, may be cloned into a vector 
for the production of an mRNA probe. Such vectors are known in the an, are commercially available, 
and may be used to synthesize RNA probes invito by addition of an appropriate RNA polymerase 
such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety 
20 of commercially available kits, such as those provided by Amersham Pharmacia Biotech, Promega 
(Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be used for 
ease of detection include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic 
agents, as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding TPPT may be cultured under 
25 condn.ons suitable for the expression and recovery of the protein from cell culture. The protein 

produced by a transformed cel. may be secreted or retained intracellular* depending on the sequence 
and/or the vector used. As will be understood by those of skill in the art, expression vectors 
contatning polynucleotides which encode TPPT may be designed to contain signal sequences which 
dtrect secretion of TPPT through a prokaryotic or eukaryotic cell membrane. 
30 In addition, a host cell strain may be chosen.for its ability to modulate expression of the 

■nserted sequences or to process the expressed protein in the desired fashion. Such modifications of 
the polypep„de include, but are no. limited to, acetylation, carboxylation, glycosylate 
phosphorylation, Hpidation, and acylation. Post-translational processing which cleaves a "prepro" or 
pro" form of the protein may also be used to specify protein targeting, folding, and/or activity 
35 Afferent host cells which have specific cellular machinery and characteristic mechanisms for 

post-translational activities (e.g., CHO. HeLa, MDCK, HEK293. and WI38) are available from the 
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American Type Culture Collection (ATCC, Manassas VA) and may be chosen to ensure the correct 
modification and processing of the f reign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding TPPT may be ligated to a heterologous sequence resulting in translation of a 
5 fusion protein in any ofthe aforementioned host systems. For example, a chimeric TPPT protein 
containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate the screening of peptide libraries for inhibitors of TPPT activity. Heterologous protein and 
pept.de moieties may also facilitate purification of fusion proteins using commercially available 
affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST) 
10 maltose binding protein (MBP). thioredoxin (Trx), calmodulin binding peptide (CBP). 6-His FLAG 
c-myc, and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate 
fuston proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal- 
chelate resins, respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity 
punfication of fusion proteins using commercially available monoclonal and polyclonal antibodies 
15 that specifically recognize these epitope tags. A fusion protein may also be engineered to contain a 
proteolytic cleavage site located between the TPPT encoding sequence and the heterologous protein 
sequence, so that TPPT may be cleaved away from the heterologous moiety following purification. 
Methods for fusion protein expression and purification are discussed in Ausubel (1995, supra ch 10) 
A variety of commercially available kits may also be used to facilitate expression and purification of 
20 fusion proteins. 

In a further embodiment of the invention, synthesis of radiolabeled TPPT may be achieved in 
mo using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These 
systems couple transcription and translation of protein-coding sequences operably associated with the 
T7, T3, or SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid 
25 precursor, for example, 35 S-methionine. 

TPPT of the present invention or fragments thereof may be used to screen for compounds that 
specifically bind to TPPT. At least one and up to a plurality of test compounds may be screened for 
spec.fk binding to TPPT. Examples of test compounds include antibodies, oligonucleotides, proteins 
(e.g., receptors), or small molecules. 

30 In one embodiment, the compound thus identified is closely related to the natural ligand of 

TPPT, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 

natural binding partner. (See, Coligan, J£. et al. (1991) Current Pmro™u in i 1(2): 

Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which TPPT 
binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In either case, the 

35 compound can be rationally designed using known techniques. In one embodiment, screening for 
these compounds involves producing appropriate cells which express TPPT, either as a secreted 
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protein or on the cell membrane. Preferred cells include cells from mammals, yeas,. Drosonhi., or E 
coh. CellsexpressingTPPTorcellmembiBnefracti ns which contain TPPT are then contacted with 
a test compound and binding, stimulation, or inhibition of activity of either TPPT or the compound is 

analyzed. 

5 An assay may simply test binding of a test compound to the polypeptide, wherein binding is 

detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. For example, 
the assay may comprise the steps of combining at least one test compound with TPPT, either in 
solution or affixed to a solid support, and detecting the binding of TPPT to the compound. 
Alternatively, the assay may detect or measure binding of a tes, compound in the presence of a 
10 labeled competitor. Additionally, the assay may be carried out using cell-free preparations, chemical 
hbranes, or natural product mixtures, and the test compound(s) may be free in solution or affixed to a 
solid support. 

TPPT of the present invention or fragments thereof may be used to screen for compounds that 
modulate the activity of TPPT. Such compounds may include agonists, antagonists, or partial or 
15 mverse agonists. In one embodiment, an assay is performed under conditions permissive for TPPT 
activity, wherein TPPT is combined with a, least one test compound, and the activity of TPPT in the 
presence of a test compound is compared with the activity of TPPT in the absence of the test 
compound. A change in the activity of TPPT in the presence of the test compound is indicative of a 
compound that modulates the activity of TPPT. Alternatively, a test compound is combined with an 
20 mvjtro or cell-free system comprising TPPT under conditions suitable for TPPT activity, and the 
assay is performed. In either of these assays, a test compound which modulates the activity of TPPT 
may do so indirectly and need not come in direct contact with the test compound. At least one and up 
to a plurality of test compounds may be screened. 

In another embodiment, polynucleotides encoding TPPT or their mammalian homologs may 
25 be "knocked out" in an animal model system using homologous recombination in embryonic stem 
(ES) cells. Such techniques are well known in the art and are useful for the generation of animal 
models of human disease. (See, e.g., U.S. Patent No. 5,175,383 and U.S. Patent No. 5,767.337.) For 
example, mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the early mouse 
embryo and grown in culture. The ES cells are transformed with a vector containing the gene of 
30 mterest disrupted by a marker gene, e.g.. the neomycin phosphotransferase gene (neo; Capecchi M R 
(1989) Science 244:1288-1292). The vector integrates into the corresponding region of the host 
genome by homologous recombination. Alternatively, homologous recombination takes place using 
the Cre-loxP system to knockout a gene of interest in a tissue- or developmental stage-specific 
manner (Marth, J.D. (1996) Clin. Invest. 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic Acids 
35 Res. 25:4323-4330). Transformed ES cells are identified and microinjected into mouse cell 

blastocysts such as those from the C57BI76 mouse strain. The Wastocysts are surgically txansferred 
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to pseudopregnam dams, and the resu , ting chimeric progeny „ genotype(j ^ ^ ^ 

heterozygous or homozygous strains. Transgenic animals thus generated may b tested with potential 
therapeutic or toxic agents. 

Polynucleotides encoding TPPT may also be manipulated hrvkro in ES cells derived from 
5 human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
■mo, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J A. et al 
(1998) Science 282:1 145-1 147). 

Polynucleotides encoding TPPT can also be used to create "knockin" humanized animals 
10 (P'g^ortransgenicanimals^iceor^tomodelhumandisease. With knockin technology a 
regron of a po.ynucleotide encoding TPPT is injected into animal ES cells, and the injected sequence 
integrates mto the animal cell genome. Transformed cells are injected into blastulae, and the 
blastulae are implanted as described above. Transgenic progeny or inbred lines are studied and 
treated with potential pharmaceutical agents to obtain information on treatment of a human disease 
15 Alternately, a mammal inbred to overexpress TPPT, e.g., by secreting TPPT in its milk, may also 
serve as a convenient source of mat protein (Janne. J. et al. (1998) Biotechnol. Annu. Rev 4 55-74) 
THERAPEUTICS 

Chemical and structural similarity, e.g.. in the context of sequences and motifs, exists 
between regions of TPPT and human transport proteins. In addition, the expression of TPPT is 
20 closely associated with neurological, cardiovascular, reproductive, gastrointestinal, and 

hematopoietic/immune tissues, and inflammation, cell proliferation, and cancer. Therefore TPPT 
appears to play a role in transport, metabolic, neurological, cardiovascular, reproductive, and immune 
d.sorders, and cell proliferative disorders including cancer. In the treatment of disorders associated 
w,«h mcreased TPPT expression or activity, it is desirable to decrease the expression or activity of 
25 TPPT. In the treatment of disorders associated with decreased TPPT expression or activity, it is 
desirable to increase the expression or activity of TPPT. 

Therefore, in one embodiment. TPPT or a fragment or derivative thereof may be administered 
to a subject to treat or prevent a disorder associated with decreased expression or activity of TPPT 
Examples of such disorders include, bu, are not limited to. a transport disorder such as akinesia 
30 amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis. Becker's muscular dystrophy 
Bell's palsy, Charcot-Marie Tooth disease, diabetes mellitus. diabetes insipidus, diabetic neuropathy 
Duchenne muscular dystrophy, hyperkalemic periodic paralysis, normokalemic periodic paralysis 
Parkmson's disease, malignant hyperthermia, multidrug resistance, myasthenia gravis, myotonic 
dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral neuropathy, cerebral neoplasms. 
35 prostate cancer; cardiac disorders associated with transport, e.g.. angina, bradyarrythmia, 
tachyanythmia, hypertension, Long QT syndrome, myocarditis, cardiomyopathy, nemaline 
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myopathy, cemronuclear myopathy, lipid myopathy, mitochondrial myopathy, thyrotoxic myopathy 
ethanol myopathy, dermatomyositis, inclusion b dy myositis, infectious myositis, polymyositis; 
neurological disorders associated with transport, e.g.. Alzheimer's disease, amnesia, bipolar disorder 
dementia, depression, epilepsy, Tourette's disorder, paranoid psychoses, and schizophrenia; and other 
5 disorders associated with transport, e.g.. neurofibromatosis, postherpetic neuralgia, trigeminal 
neuropathy, sarcoidosis, sickle ce.l anemia, Wilson's disease, cataracts, infertility, pulmonary artery 
stenosis, sensorineural autosomal deafness, hyperglycemia, hypoglycemia, Grave's disease, goiter 
Cushing's disease, Addison's disease, glucose-galactose malabsorption syndrome, 
hypercholesterolemia, adreno.eukodystrophy, Zellweger syndrome. Menkes disease, occipital horn 
10 syndrome, von Gierke disease, cystinuria, iminoglycinuria, Hartup disease, and Fanconi disease; a 
metabohc disorder such as Addison's disease, cerebrotendinous xanthomatosis, congenital adrenal 
hyperplasia, coumarin resistance, cystic fibrosis, diabetes, fatty hepatocirrhosis. 
fructose-l.^iphosphatase deficiency, galactosemia, goiter, glucagonoma, glycogen storage diseases, 
hereditary fructose intolerance, hyperadrenalism, hypoadrenalism, hyperparathyroidism 
15 hypoparathyroidism, hypercholesterolemia, hyperthyroidism, hypoglycemia, hypothyroidism, 
hyperl.pidemia, hyperlipemia, lipid myopathies, lipodystrophies, lysosomal storage diseases 
mannosidosis. neuraminidase deficiency, obesity, pentosuria phenylketonuria, and pseudovitarnin D- 
deficency rickets; a neurological disorder such as epilepsy, ischemic cerebrovascular disease, stroke, 
cerebral neoplasms, Alzheimer's disease. Pick's disease, Huntington's disease, dementia, Parkinson's 
20 disease and other extrapyramidal disorders, amyotrophic lateral sclerosis and other motor neuron 
disorders, progressive neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple 
sclerosis and other demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural 
empyema, epidural abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral 
central nervous system disease, prion diseases including kuni, Creutzfeldt-Jakob disease, and 
25 Gerstmann-Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases 
of the nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis 
encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 
nervous system, cerebral palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial 
nerve disorders, spinal cord diseases, muscular dystrophy and other neuromuscular disorders 
30 peripheral nervous system disorders, dermatomyositis and polymyositis, inherited, metabolic, 
endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental disorders including 
mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), akathesia. amnesia, 
catatoma. diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic 
neuralgia. Tourette's disorder, progressive supranuclear palsy, corticobasal degeneration, and familial 
35 frontotemporal dementia; a cardiovascu.ar disorder such as arteriovenous fistula, atherosclerosis, 
hypertension, vasculitis, Raynaud's disease, aneurysms, arterial dissections, varicose veins 
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thrombophlebitis and phlebothrombosis. vascular tumors, and complications of thrombolysis, balloon 
ang.op.asry, vascular replacement, and coronary artery bypass graft surgery, congestive heart failure 
.schermc heart disease, angina pectoris, myocardial infarction, hypertensive heart disease 
degenerative valvular heart disease, calcific aortic valve stenosis, congenitally bicuspid aortic valve 
5 nutral annular calcification, mitral valve prolapse, rheumatic fever and rheumatic heart disease 
tnfecuve endocarditis, nonbacterial thrombotic endocarditis, endocarditis of systemic lupus 
erythematosus, carcinoid heart disease, cardiomyopathy, myocarditis, pericarditis, neoplastic heart 
d.sease, congenital heart disease, and comp.ications of cardiac transplantation, congenital lung 
anomal.es, atelectasis, pulmonary congestion and edema, pulmonary embolism, pulmonary 
10 hemorrhage, pulmonary infarction, pulmonary hypertension, vascular sclerosis, obstructive pulmonary 
d.sease, restrictive pulmonary disease, chronic obstructive pu.monary disease, emphysema, chronic 
bronchitis, bronchia, asthma, bronchiectasis, bacteria, pneumonia, viral and mycop.asma. pneumonia, 
lung abscess, pulmonary tuberculosis, diffuse interstitial diseases, pneumoconioses, sarcoidosis 
.d.opa«h,c pulmonary fibrosis, desquamative interstitial pneumonitis, hypersensitivity pneumonitis 
15 pulmonary eosinophil bronchiolitis obliterans-organizing pneumonia, diffuse pulmonary 

hemorrhage syndromes. Goodpasture's syndromes, idiopathic pulmonary hemosiderosis, pulmonary 
•nvol vement in collagen-vascular disorders, pulmonary alveolar proteinosis, lung tumors 
mflammatory and noninflammatory pleural effusions, pneumothorax, pleural tumors, drug-induced 
lung chsease, radiation-induced lung disease, and comp.ications of lung transplantation- a 
20 reproductive disorder such as a disorder of pro.actin production, infertility, including tuba, disease 
ovular defects, and endometriosis, a disruption of the estrous cycle, a disruption of the menstrua, 
cycle, polycystic ovary syndrome, ovarian hyperstimulation syndrome, an endometrial or ovarian 
tumor, a uterine fibroid, autoimmune disorders, an ectopic pregnancy, and teratogenesis; cancer of the 
breast, fibrocystic breast disease, and galactorrhea; a disruption of spermatogenesis, abnormal sperm 
phys.ology, cancer of the testis, cancer of the prostate, benign prostatic hyperopia, prostatitis. 
Peyron.e's d.sease, impotence, carcinoma of the male breast, and gynecomastia; an immune disorder 
such as .nflammauon, actinic keratosis, acquired immunodeficiency syndrome (AIDS), Addison's 
d.sease, adul, respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, 
artenosclerosis. asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis 
30 au.o.mmune po.yendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, bursitis 
cholecystitis, cirrhosis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyosi.is, 
dtabetes me.htus, emphysema, episodic lymphopenia with lymphocytoses, erythroblastosis fetalis 
erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout. Graves' 
d.sease, Hashimoto's thyroiditis, paroxysmal nocturnal hemoglobinuria, hepatitis. hype re osinophilia 
5 .ratable bowe. syndrome, mixed connective tissue disease (MCTD), multiple sclerosis, myasthenia ' 
gravs. myocardial or pericardia, inflammation, myelofibrosis, osteoarthritis, osteoporosis 
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pancreatitis, polycythemia vera, polymyositis, psoriasis, Reker's syndrome, rheumatoid arthritis, 
scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic 
sclerosis, primary thrombocythemia, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner 
syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, 
5 fungal, parasitic, protozoal, and helminthic infections, trauma, and hematopoietic cancer including 

lymphoma, leukemia, and myeloma; and a cell proliferative disorder such as actinic keratosis. 

arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease 

(MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, primary 

thrombocymemia. and cancers including adenocarcinoma, leukemia, lymphoma, melanoma, 
10 myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone 

bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver. 

lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, 

thymus, thyroid, and uterus. 

In another embodiment, a vector capable of expressing TPPT or a fragment or derivative 
15 thereof may be administered to a subject to treat or prevent a disorder associated with decreased 
expression or activity of TPPT including, but not limited to. those described above. 

In a further embodiment, a pharmaceutical composition comprising a substantially purified 
TPPT in conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat 
or prevent a disorder associated with decreased expression or activity of TPPT including, but not 
20 limited to, those provided above. 

In still another embodiment, an agonist which modulates the activity of TPPT may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of TPPT including, but not limited to, those listed above. 

In a further embodiment, an antagonist of TPPT may be administered to a subject to treat or 
25 prevent a disorder associated with increased expression or activity of TPPT. Examples of such 
disorders include, but are not limited to, those transport, metabolic, neurological, cardiovascular, 
reproductive, and immune disorders, and cell proliferative disorders including cancer described 
above. In one aspect, an antibody which specifically binds TPPT may be used directly as an 
antagonist or indirectly as a targeting or delivery mechanism for bringing a pharmaceutical agent to 
30 cells or tissues which express TPPT. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
encoding TPPT may be administered to a subject to treat or prevent a disorder associated with 
mcreased expression or activity of TPPT including, bu, not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 
sequences, or vectors of the invention may be administered in combination with other appropriate 
therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made 
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by one of ordinary skill in the an. according to conventional pharmaceutical principles. The 
combination of therapeutic agents may act synergistically to effect the treatment or prevention of the 
various disorders described above. Using this approach, one may be able to achieve therapeutic 
efficacy with lower dosages of each agent, thus reducing the potential for adverse side effects. 
5 An antagonist of TPPT may be produced using methods which are generally known in the art. 

In particular, purified TPPT may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind TPPT. Antibodies to TPPT may also 
be generated using methods that are well known in the art. Such antibodies may include, but are not 
limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and 
10 fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit 
dimer formation) are generally preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 
and others may be immunized by injection with TPPT or with any fragment or oligopeptide thereof 
wh,ch has immunogenic properties. Depending on the host species, various adjuvants may be used to 
15 mcrease immunological response. Such adjuvants include, but are not limited to, Freund's. mineral 
gels such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic polyols, 
polyamons, peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in humans, BCG 

(bacilli Calmette-Guerin) and Corvnehacteri,,™ r are especialI y preferabIe . 

It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to TPPT 
20 have an amino acid sequence consisting of at least about 5 amino acids, and generally will consist of 
at least about 10 amino acids. It is also preferable mat these oligopeptides, peptides, or fragments are 
idenncal to a portion of the amino acid sequence of the natural protein. Short stretches of TPPT 
amino acids may be fused with those of another protein, such as KLH, and antibodies to the chimeric 
molecule may be produced. 

25 Monoclonal antibodies to TPPT may be prepared using any technique which provides for the 

production of antibody molecules by continuous cell lines in culture. These include, but are not 
limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EB V-hybridoma 
technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 
Immunol. Methods 81:31-42; Cote. R.J. et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; and 

30 Cole. SP. et al. (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies." such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See. e.g.. Morrison. S.L. et al. (1984) Proc 
Natl. Acad. Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604^08; and Takeda 

35 S. et al. (1985) Nature 314:452^54.) Altemative.y, techniques described for the production of single 
cha,n anubodies may be adapted, using methods known in the art, to produce TPPT-specific single 
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chain antibodies. Antibodies with re lated specificity, but of distinct idiotypic composition, may be 
generated by chain shuffling from random combinatorial immunoglobulin libraries. (See. e g 
Burton. D.R. (1991) Proc. Natl. Acad. Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing mviyo production in the lymphocyte 
5 populate or by screening immunoglobulin libraries or panels of highly specific binding reagents as 
dtsc.osed in the literature. (See. e.g., Orlandi. R. et al. (1989) Proc. Natl. Acad. Sci. USA 
86:3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for TPPT may also be generated 
For example, such fragments include, but are not limited to. F^ fragments produced by pepsin 
10 dtgest.on of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 
the F(ab)2 fragments. Alternatively. Fab expression libraries may be constructed to allow rapid and 
easy tdentification of monoclonal Fab fragments with the desired specificity. (See, e.g , Huse W D 
et al. (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 
15 spectficty. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art Such 
•mmunoassays typically involve the measurement of complex formation between TPPT and its 
specrfic antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
reacve to two non-interfering TPPT epitopes is generally used, but a competitive binding assay may 
20 also be employed (Pound, supra). 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay 
techmques may be used to assess the affinity of antibodies for TPPT. Affinity is expressed as an 
assocatron constant, K,, which is defined as the molar concentration of TPPT-antibody complex 
d.v.ded by the molar concentrations of free antigen and free antibody under equilibrium conditions 
The K, determmed for a preparation of polyclonal antibodies, which are heterogeneous in their 
affimties for multiple TPPT epitopes, represents the average affinity, or avidity, of the antibodies for 
TPPT. The K, determined for a preparation of monoclonal antibodies, which are monospecific for a 
parttcular TPPT epitope, represents a true measure of affinity. High-affinity antibody preparations 
with K, ranging from about 10' to 10" L/mole are preferred for use in immunoassays in which the 
30 TPPT-anfbody complex must withstand rigorous manipulations. Low-affinity antibody preparations 
w.th K, ranging from about 10* to 10> L/mole are preferred for use in immunopurification and similar 
procedures which ultimately require dissociation of TPPT. preferably in active form, from the 
antibody (Catty. D. (1988) Antibodies. Vnh^ i- a o^,. ^ ir „ ^ ^ ^ 
L.ddell, J.E. and A. Cryer (1991) A Practica. n,,^ m m ^., , nfM Joh| Wi , 
35 New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to 
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detennine the quality and suitability of such preparations for certain downstream applications For 
example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 
5-10 mg spec.fic antibody/ml, is generally employed in procedures requiring precipitation of TPPT- 
anubody complexes. Procedures for evaluating antibody specificity, titer, and avidity, and guidelines 
5 for anubody quality and usage in various applications, are generally available. (See, e.g., Catty, 
supra, and Coligan et al., supra .) 

In another embodiment of the invention, the polynucleotides encoding TPPT, or any fragment 
or complement thereof, may be used for therapeutic purposes. In one aspect, modifications of gene 
expresston can be achieved by designing complementary sequences or antisense molecules (DNA 
«0 RNA. PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene encoding 
TPPT. Such technology is well known in the art, and antisense oligonucleotides or larger fragments 
can be destgned from various locations along the coding or control regions of sequences encoding 
TPPT. (See, e.g., Agrawal, S., ed. (1996) Antisense Th^,^ H umana Press Inc., Totawa NJ.) 
In therapeutic use, any gene delivery system suitable for introduction of the antisense 
15 sequences into appropriate target cells can be used. Antisense sequences can be delivered 

.ntracellularly in the form of an expression plasmid which, upon transcription, produces a sequence 
complementary to at leas, a portion of the cellular sequence encoding the target protein. (See. e.g 
Slater, J.E. et al. (1998) J. Allergy Clin. Immunol. 102(3):469-475; and Scanlon. KJ. et al. (1995) 
9(13):1288-1296.) Antisense sequences can also be introduced intracellularly through the use of viral 
20 vectors, such as retrovirus and adeno-associated virus vectors. (See. e.g.. Miller. A.D. (1990) Blood 
76:271 ; Ausubel. sups; Uckert. W. and W. Walther (1994) Pharmacol. Ther. 63(3):323-347.) Other 
gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, and other 
systems known in the art. (See, e.g., Rossi. JJ. (1995) Br. Med. Bull. 51(l):217-225; Boado R J et 
al. (1998) J. Pharm. Sci. 87(10:1308-1315: and Morris. M.C. et al. (1997) Nucleic Acids Res 
25 25(14):2730-2736.) 

In another embodiment of the invention, polynucleotides encoding TPPT may be used for 
somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 
(e.g., m the cases of severe combined immunodeficiency (SCID)-X1 disease characterized by X- 
hnked inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672). severe combined 

30 tnununodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 
(Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475) 
cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R.G. e. al. (1995) Hum. Gene 
Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703), thalassamias, familia. 
hypercholesterolemia, and hemophilia resulting from Factor Vm or Factor DC deficiencies (Crystal 

35 R.G. (1995) Science 270:40*410: Verma, I.M. and Somia, N. (1997) Nature 389:239-242)) (ii) 

express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated 
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edl proliferation), or (iii) express a protein which affords protection against intracellular parasites 
(e.g., against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D (1988) 
Nature 335:395-396; Poeschla, E. et al. (1996) Proc. Natl. Acad. Sci. USA. 93:1 1395-1 1399) 
hepatitis B or C virus (HBV, HCV); fungal parasites, such as Candida^bicaj^ and Pantcoccidio.H., 
> kMlhensk; and protozoan parasites such as Plasmodium frw^ anH Trvoanosn™ fo ^ 

case where a genetic deficiency in TPPT expression or regulation causes disease, the expression of 
TOT from an appropriate population of transduced cells may alleviate the clinical manifestations 
caused by the genetic deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in 
TPPT are treated by constructing mammalian expression vectors encoding TPPT and introducing 
these vectors by mechanical means into TPPT-deficient cells. Mechanical transfer technologies for 
use wuh cells inviyo or exWtro include (i) direct DNA microinjection into individual cells, (ii) 
balhsttc gold particle delivery. (Hi) liposome-mediated transfection, (iv) receptor-mediated gene 
transfer, and (v) the use of DNA transposons (Morgan. R.A. and W.F. Anderson (1993) Annu Rev 
B,ochem. 62:191-217; Ivies. Z. (1997) Cell 91:50.-510; Boulay. J-L. and H. Recipon (1998) Curr. 
Opin. Biotechnol. 9:445-450). 

Expression vectors that may be effective for the expression of TPPT include, but are not 
limited to. the PCDNA 3.1. EPITAG. PRCCMV2. PREP, PVAX vectors (Invitrogen, Carlsbad CA) 
PCMV-SCRIPT. PCMV-TAG. PEGSH/PERV (Stratagene. La Jo.la CA), and PTET-OFF. PTET-ON 
PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). TPPT may be expressed using (i) a 
const.tut.vely active promoter, (e.g.. from cytomegalovirus (CMV). Rous sarcoma virus (RSV) SV40 
virus, thymidine kinase (TK). or P -actin genes), (ii) an inducible promoter (e.g., the 
tetracycline-regulated promoter (Gossen, M. and H. Bujard (1992) Proc. Natl. Acad. Sci. USA 
89:5547-5551; Gossen, M. et al. (1995) Science 268:1766-1769; Rossi, F.M.V. and H.M. Blau (1998) 
Curr. Opin. Biotechnol. 9:45M56), commercially available in the T-REX plasmid (Invitrogen)); the 
ecdysone-mducible promoter (available in the plasmids PVGRXR and PIND; Invitrogen); the 
FK506/rapamycin inducible promoter; or the RU486/mife P ristone inducible promoter (Rossi, F.M V 
and H.M. Blau, supra)), or (iii) a tissue-specific promoter or the native promoter of the endogenous 
gene encoding TPPT from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optimize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, F.L. and A.J. Eb (1973) Virology 52:456-467), or by e.ectroporation (Neumann, E. et al 
(1982) EMBO J. 1:841-845). The introduction of DNA to primary cells requires modification of 
these standardized mammalian transfeciion protocols. 
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In another embodiment of the invention, diseases or disorders caused by genetic defects with 
respect to TPPT expression are treated by constructing a retrovirus vector consisting of (i) the 
p lynucle tide encoding TPPT under the control of an independent promoter or the retrovirus long 
terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 
element (RRE) along with additional retrovirus cw-acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 
commercially available (Stratagene) and are based on published data (Riviere, I. et al. (1995) Proc. 
Nad. Acad. Sci. USA 92:6733-6737), incorporated by reference herein. The vector is propagated in 
an appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropism for 
receptors on the target cells or a promiscuous envelope protein such as VSVg (Armentano, D. et al. 
(1987) J. Virol. 61:1647-1650; Bender, MA et al. (1987) J. Virol. 61 :1639-1646; Adam, M.A. and 
AD. Miller (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. 
et al. (1998) J. Virol. 72:9873-9880). U.S. Patent Number 5,910,434 to Rigg ("Method for obtaining 
retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant") 
15 discloses a method for obtaining retrovirus packaging cell lines and is hereby incorporated by 

reference. Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4* T-cells), 
and the return of transduced cells to a patient are procedures well known to persons skilled in the art 
of gene therapy and have been well documented (Ranga, U. et al. (1997) J. Virol. 71 :7020-7029; 
Bauer, G. et al. (1997) Blood 89:2259-2267; Bonyhadi. M.L. (1997) J. Virol. 71:4707^716; Ranga, 
5 U. et al. (1998) Proc. Natl. Acad. Sci. USA 95:1201-1206; Su, L. (1997) Blood 89:2283-2290). 

In the alternative, an adenovirus-based gene therapy delivery system is used to deliver 
polynucleotides encoding TPPT to cells which have one or more genetic abnormalities with respect to 
the expression of TPPT. The construction and packaging of adenovirus-based vectors are well known 
to those with ordinary skill in the art. Replication defective adenovirus vectors have proven to be 
i versatile for importing genes encoding immunoregulatory proteins into intact islets in the pancreas 
(Csete, M.E. et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are 
described in U.S. Patent Number 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), 
hereby incorporated by reference. For adenoviral vectors, see also Aminozzi, P.A. et al. (1999) Annu. 
Rev. Nutr. 19:51 1-544; and Verma, I.M. and N. Somia (1997) Nature 18:389:239-242, both 
incorporated by reference herein. 

In another alternative, a herpes-based, gene therapy delivery system is used to deliver 
polynucleotides encoding TPPT to target cells which have one or more genetic abnormalities with 
respect to the expression of TPPT. The use of herpes simplex virus (HSV)-based vectors may be 
especially valuable for introducing TPPT to cells of the central nervous system, for which HSV has a 
tropism. The construction and packaging of herpes-based vectors are well known to those with 
ordinary skill in the art. A replication-competent herpes simplex virus (HSV) type 1 -based vector has 
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been used to deliver a reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. Eye 
Res.l69:385-395). The construction of a HSV-1 virus vector has also been disci sed in detail in U S 
Patent Number 5.804,413 to DeLuca ("Herpes simplex vims strains for gene transfer"), which is 
hereby incorporated by reference. U.S. Patent Number 5,804,413 teaches the use of recombinant 
5 HSV d92 which consists of a genome containing at least one exogenous gene to be transferred to a 
cell under the control of the appropriate promoter for purposes including human gene therapy. Also 
taught by this patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 
and ICP22. For HSV vectors, see also Coins. W.F. et al. (1999) J. Virol. 73:519-532 and Xu. H et al 
(1994) Dev. Biol. 163:152-161. hereby incorporated by reference. The manipulation of cloned 
10 herpesvirus sequences, the generation of recombinant virus following the transfection of multiple 
plasmids containing different segments of the large herpesvirus genomes, the growth and propagation 
of herpesvirus, and the infection of cells with herpesvirus are techniques well known to those of 
ordinary skill in the an. 

In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 
15 deliver polynucleotides encoding TPPT to target cells. The biology of the prototypic alphavirus. 
Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based 
on the SFV genome (Garoff, H. and K,J. Li (1998) Curr. Opin. Biotech. 9:464-169). During 
alphavirus RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid 
proteins. This subgenomic RNA replicates to higher levels than the full-length genomic RNA, 
20 resulting in the overproduction of capsid proteins relative to the viral proteins with enzymatic activity 
(e.g., protease and polymerase). Similarly, inserting the coding sequence for TPPT into the alphavirus 
genome in place of the capsid-coding region results in the production of a large number of TPPT- 
coding RNAs and the synthesis of high levels of TPPT in vector transduced cells. While alphavirus 
mfection is typically associated with cell lysis within a few days, the ability to establish a persistent 
25 infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) indicates that 
the lytic replication of alphaviruses can be altered to suit the needs of the gene therapy application 
(Dryga, S.A. e, al. (1997) Virology 228:74-83). The wide host range of alphavimses will allow the 
introduction of TPPT into a variety of cell types. The specific transduction of a subset of cells in a 
population may require the sorting of cells prior to transduction. The methods of manipulating 
30 infectious cDNA clones of alphavimses, performing alphavirus cDNA and RNA transfections. and 
performing alphavirus infections, are well known to those with ordinary skill in the art. 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions 
-10 and + 10 from the start site, may also be employed to inhibit gene expression. Similarly, inhibition 
can be achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it 
35 causes inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, 
transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have 
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been described in the literature. (See, e.g., Gee, J£. et al. (1994) in Huber. B.E. and B.I. Carr, 
Molecular and Immunologic Approaches Futura Publishing, Mt. Kisco NY. pp. 163-177.) A 
complementary sequence or antisense molecule may also be designed to block translation of mRNA 
by preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of sequences encoding TPPT. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 
15 candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared 
by any method known in the art for the synthesis of nucleic acid molecules. These include techniques 
for chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 
20 Alternatively, RNA molecules may be generated by invitro and invivo transcription of DNA 

sequences encoding TPPT. Such DNA sequences may be incorporated into a wide variety of vectors 
with suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs 
that synthesize complementary RNA. constitutively or inducibly, can be introduced into cell lines, 
cells, or tissues. 

25 RNA molecules may be modified to increase intracellular stability and half-life. Possible 

modifications include, but are not limited to. the addition of flanking sequences at the 5'and/or 3'ends 
of the molecule, or the use of phosphorothioate or T O-methyl rather than phosphodiesterase linkages 
w.thin the backbone of the molecule. This concept is inherent in the production of PNAs and can be 
extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, 
and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, 
guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases. 

An additional embodiment of the invention encompasses a method for screening for a 
compound which is effective in altering expression of a polynucleotide encoding TPPT. Compounds 
which may be effective in altering expression of a specific polynucleotide may include, but are not 
limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming oligonucleotides, 
transcription factors and other polypeptide transcriptional regulators, and non-macromolecular 
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chemical entities which are capable of interacting wi.h specific polynucleotide sequences. Effective 
compounds may alter polynucleotide expression by acting as either inhibitors or promoters of 
polynucleotide expression. Thus, in the treatment of disorders associated with increased TPPT 
expression or activity, a compound which specifically inhibits expression of the polynucleotide 
5 encoding TPPT may be therapeutically useful, and in the treament of disorders associated with 
decreased TPPT expression or activity, a compound which specifically promotes expression of the 
polynucleotide encoding TPPT may be therapeutically useful. 

At least one, and up to a plurality, of test compounds may be screened for effectiveness in 
altering expression of a specific polynucleotide. A test compound may be obtained by any method 
10 commonly known in the art, including chemical modification of a compound known to be effective in 
altenng polynucleotide expression; selection from an existing, commercially-available or proprietary 
library of naturally-occurring or non-natural chemical compounds; rational design of a compound 
based on chemical and/or structural properties of the target polynucleotide; and selection from a 
library of chemical compounds created combinatorial* or randomly. A sample comprising a 
15 polynucleotide encoding TPPT is exposed to at least one tes, compound thus obtained. The sample 
may comprise, for example, an intact or permeabilized cell, or an in_vjtr<> cell-free or reconstituted 
biochemtcal system. Alterations in the expression of a polynucleotide encoding TPPT are assayed by 
any method commonly known in the art. Typically, the expression of a specific nucleotide is detected 
by hybndization with a probe having a nucleotide sequence complementary to the sequence of the 
20 polynucleotide encoding TPPT. The amount of hybridization may be quantified, thus forming the 
bas,s for a comparison of the expression of the polynucleotide both with and without exposure to one 
or more test compounds. Detection of a change in the expression of a polynucleotide exposed to a 
tes, compound indicates that the test compound is effective in altering the expression of the 
polynucleotide. A screen for a compound effective in a .tering expression of a specific polynuc.eotide 
can be earned out, for example, using a Schizosaccharomv,,., r ^ gene expression system 
(Atkins, D. et al. (1999) U.S. Patent No. 5.932,435; Arndt. G.M. et al. (2000) Nucleic Acids Res. 
28;E15) or a human cell line such as HeLa cell (Clarke, M.L. et al. (2000) Biochem. Biophys. Res. 
Commun. 268:8-13). A particular embodiment of the present invention involves screening a 
combinatorial library of oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide 
nucleic acids, and modified oligonucleotides) for antisense activity against a specific polynucleotide 
sequence (Bruice, T.W. et al. (1997) U.S. Patent No. 5,686.242; Bruice, T.W. e, al. (2000) U S Patent 
No. 6,022,691). 

Many methods for introducing vectors into cells or tissues are available and equally suitable 
for use invivo, i^, and ex^vo. For ex_vivo therapy, vectors may be introduced into stem cells 
35 taken from the patient and clonally propagated for autologous transplant back into that same patient. 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
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using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. (1997) Nat. 
Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 

monkeys. 

An additional embodiment of the invention relates to the administration of a pharmaceutical 
composition which generally comprises an active i ngre dient formulated with a pharmaceutical* 
acceptable excipient. Excipients may include, for example, sugars, starches, celluloses, gums, and 
proteins. Various formulations are commonly known and are thoroughly discussed in the latest 
edi,i0n - Remington's Pharmaceutical SqVnres (Maack Publishing. Easton PA). Such 
pharmaceutical compositions may consist of TPPT. antibodies to TPPT. and mimetics, agonists, 
antagonists, or inhibitors of TPPT. 

The phannaceutical compositions utilized in this invention may be administered by any 
number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, 
15 intramedullary, intrathecal, intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, 
intranasal, enteral, topical, sublingual, or rectal means. 

Pharmaceutical compositions for pulmonary administration may be prepared in liquid or dry 
powder form. These compositions are generally aerosolized immediately prior to inhalation by the 
patient. In the case of small molecules (e.g. traditional low molecular weight organic drugs), aerosol 
20 delivery of fast-acting formulations is well-known in the art. In the case of macromolecules (e.g. 
larger peptides and proteins), recent developments in the field of pulmonary delivery via the alveolar 
region of the lung have enabled the practical delivery of drugs such as insulin to blood circulation 
(see, e.g., Patton, J.S. et al., U.S. Patent No. 5,997,848). Pulmonary delivery has the advantage of 
administration without needle injection, and obviates the need for potentially toxic penetration 
25 enhancers. 

Pharmaceutical compositions suitable for use in the invention include compositions wherein 
the active ingredients are contained in an effective amount to achieve the intended purpose. The 
determination of an effective dose is well within the capability of those skilled in the art. 

Specialized forms of pharmaceutical compositions may be prepared for direct intracellular 
30 delivery of macromolecules comprising TPPT or fragments thereof. For example, liposome 

preparations containing a cell-impermeable macromolecule may promote cell fusion and intracellular 
dehvery of the macromolecule. Alternatively, TPPT or a fragment thereof may be joined to a short 
cationic N-terminal portion from the fflV Tat- 1 protein. Fusion proteins thus generated have been 
found to transduce into the cells of all tissues, including the brain, in a mouse model system 
35 (Schwarze. S.R. et al. (1999) Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
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culture assays, e.g.. of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, monkeys 
or pigs. An animal model may also be used to determine the appropriate concentration range and 
route of administration. Such information can then be used to determine useful doses and routes for 

administration in humans. 

5 A therapeutically effective dose refers to that amount of active ingredient, for example TPPT 

or fragments thereof, antibodies of TPPT. and agonists, antagonists or inhibitors of TPPT, which 
ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined by 
standard pharmaceutical procedures in cell cultures or with experimental animals, such as by 
calculating the ED* (the dose therapeutically effective in 50% of the population) or LD, (the dose 
10 lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the U^ED. ratio. Pharmaceutical compositions 
wh,ch exhibit large therapeutic indices are preferred. The data obtained from cell culture assays and 
animal studies are used to formulate a range of dosage for human use. The dosage contained in such 
compositions is preferably within a range of circulating concentrations that includes the ED* with 
15 little or no toxicity. The dosage varies within this range depending upon the dosage form employed, 
the sensitivity of the patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the 
subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the 
acuve moiety or to maintain the desired effect. Factors which may be taken into account include the 
20 seventy of the disease state, the general health of the subject, the age, weight, and gender of the 
subject, time and frequency of administration, drug combination(s), reaction sensitivities, and 
response to therapy. Long-acting pharmaceutical compositions may be administered every 3 to 4 
days, every week, or biweekly depending on the half-life and clearance rate of the particular 
formulation. 

25 Normal dosage amounts may vary from about 0.1 M g to 1 00.000 ^g, up to a total dose of 

about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 

30 conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind TPPT may be used for the 
diagnosis of disorders characterized by expression of TPPT. or in assays to monitor patients being 
treated with TPPT or agonists, antagonists, or inhibitors of TPPT. Antibodies useful for diagnostic 
35 purposes may be prepared in the same manner as described above for therapeutics. Diagnostic assays 
for TPPT include methods which utilize the antibody and a label to detect TPPT in human body fluids 
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r in extracts of cells or tissues. The antibodies may be used with or without modification, and may 
be labeled by covalent or non-covalen, attachment of a reporter mo.ecule. A wide variety of reporter 
molecules, several f which are described above, are known i„ the art and may be used. 

A variety of protocols for measuring TPPT, including ELISAs, RIAs, and FACS. are known 
5 m the art and provide a basis for diagnosing altered or abnormal levels of TPPT expression. Normal 
or standard values for TPPT expression are established by combining body fluids or cell extracts 
taken from normal mammalian subjects, for example, human subjects, with antibody to TPPT under 
condmons suitable for complex formation. The amount of standard complex formation may be 
quanutated by various methods, such as photometric means. Quantities of TPPT expressed in subject 
10 control, and disease samples from biopsied tissues are compared with the standard values. Deviation 
between standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding TPPT may be used for 
diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and 
15 quanufy gene expression in biopsied tissues in which expression of TPPT may be correlated with 
dtsease. The diagnostic assay may be used to determine absence, presence, and excess expression of 
TPPT. and to monitor regulation of TPPT levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 
sequences, mcluding genomic sequences, encoding TPPT or closely related molecules may be used to 
20 .dentify nucleic acid sequences which encode TPPT. The specificity of the probe, whether it is made 
from a h.ghly specific region, e.g., the 5' regulatory region, or from a less specific region, e g a 
conserved motif, and the stringency of the hybridization or amplification will determine whether the 
probe tdentifies only naturally occurring sequences encoding TPPT. allelic variants, or related 
sequences. 

25 Probes may also be used for the detection of related sequences, and may have at least 50% 

sequence identity to any of the TPPT encoding sequences. The hybridization probes of the subject 
mvention may be DNA or RNA and may be derived from the sequence of SEQ JD NO : 44-86 or from 
genomic sequences including promoters, enhancers, and introns of the TPPT gene. 

Means for producing specific hybridization probes for DNAs encoding TPPT include the 
30 clonmg of polynucleotide sequences encoding TPPT or TPPT derivatives into vectors for the 

production of mRNA probes. Such vectors are known i„ the art, are commercially avai.ab.e. and may 
be used to synthesize RNA probes invjjro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a 
vanety of reporter groups, for example, by radionuclides such as *P or *S, or by enzymatic labels 
35 such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 
Polynucleotide sequences encoding TPPT may be used for the diagnosis of disorders 
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associated wi,hexp ress io„onPPr. Examplesof such disorders inc.ude. bu , „ not limited , 

.nstp.dus, d.abetic neuropathy, Duchenne m uscuiar dystrophy, hyperkalemic periodic paralysis 
nor^cp^ 

myasthema gravis. m y 0 to„i c dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral 
neuropathy, cerebral neoplasms, pros tate cancer; cardiac disorders associated with transport e g 
angma, bradyarrythmia, tachyarrythmia, hypertension. Long QT syndrome, myocarditis ' ' 
cardiopathy, nemaline myopathy, centronuclear myopathy, lipid myopathy, mitochondrial 
myopathy, thyrotoxic myopathy, ethano, myopathy, dermatomyositis. inclusion body myositis 
mfectious myositis, polymyositis; neurological disorders associated with ttans^ e .g.. Alzheimer's 
dtsease, amneaa, bipolar disorder, dementia, depression, epilepsy, Tourette's disorder, paranoid 
Psychoses, ^schizophrenia; and o^ 

postherpeuc neuralgia, trigeminal neuropathy, sarcoidosis, sickle eel, anemia. Wilson's disease 
catatacts, infertility, pulmonary artery stenosis, sensorineural autosomal deafness, hyperglycemia, 
hypoglycemia, Grave'sdisease. goiter. Cushing's disease, Addison's disease, g.ucose-galacto. 
malabsorption syndrome, hypercholesterolemia, ad.noleukodystrophy, ZeHweger syndrome. Menkes 
dtsease, occ.pita, horn syndrome, von Gierke disease, cystinuria. iminoglycinuria. Hartup disease, and 

congemtal adrenal hyperp,asi, coumarin resistance, cyst* fibrosis, diabetes, fatty hepatocirrhosis. 

Setose- ..^phosphatase deficiency, galactosemia, goiter, g.ucagonoma, glycogen storage diseases, 

hereduary fructose intolerance, hyperadrenalism. hypoadrenalism. hyperparathyroidism. 

hypoparathyroidism, hypercholesterolemia, hyperthyroidism, hypoglycemia, hypothyroidism. 

hyperemia, hyperlipemia, lipid myopathies, lipodystrophies, lysosomal storage diseases 
25 mannos.dosis, neuraminidase deficiency, obesity, pentosuria phenylketonuria, and pseudovitamin D- 
deficency rickets; a neurological disorder such as epilepsy, ischemic cerebrovascu,ar disease, stroke 
cerebral neoplasms, Alzheimer's disease, Pick's disease. Huntington's disease, dementia. Parkinson's 
dtsease and other extrapyramidal disorders, amyotrophic lateral sclerosis and other motor neuron 
d.sorders. progressive neura, muscular atrophy, retinitis pigmentosa, hereditary ataxias, mu.tiple 

empyema, epidural abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis viral 
central nervous system disease, prion diseases including kuru. Creutzfe.dt-Jakob disease, and ' 
^tmann-Straussler-Scheinker syndrome, fatal familia. insomnia, nutritional and metabolic diseases 
of the nervous system, neurofibromatosis, tuberous sderosis. cerebe.loretina. hemangiob.astomatosis. 
encephalotngem^ 

nervous system, cerebral pa.sy, neuroskeleta, disorders, autonomic nervous system disorders, crania, 
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nerve disorders, spinal cord diseases, muscular dystrophy and other neuromuscular disorders, 
peripheral nervous system disorders, dermatomyositis and polymyositis, inherited, metabolic, 
endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental disorders including 
mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), akathesia, amnesia, 
5 catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic 

neuralgia, Tourette's disorder, progressive supranuclear palsy, corticobasal degeneration, and familial 
frontotemporal dementia; a cardiovascular disorder such as arteriovenous fistula, atherosclerosis, 
hypertension, vasculitis, Raynaud's disease, aneurysms, arterial dissections, varicose veins, 
thrombophlebitis and phlebothrombosis, vascular tumors, and complications of thrombolysis, balloon 
10 angioplasty, vascular replacement, and coronary artery bypass graft surgery, congestive heart failure, 
ischemic heart disease, angina pectoris, myocardial infarction, hypertensive heart disease, 
degenerative valvular heart disease, calcific aortic valve stenosis, congenitally bicuspid aortic valve, 
mitral annular calcification, mitral valve prolapse, rheumatic fever and rheumatic heart disease, 
infective endocarditis, nonbacterial thrombotic endocarditis, endocarditis of systemic lupus 
15 erythematosus, carcinoid heart disease, cardiomyopathy, myocarditis, pericarditis, neoplastic heart 
disease, congenital heart disease, and complications of cardiac transplantation, congenital rung 
anomalies, atelectasis, pulmonary congestion and edema, pulmonary embolism, pulmonary 
hemorrhage, pulmonary infarction, pulmonary hypertension, vascular sclerosis, obstructive 
pulmonary disease, restrictive pulmonary disease, chronic obstructive pulmonary disease, 
20 emphysema, chronic bronchitis, bronchial asthma, bronchiectasis, bacterial pneumonia, viral and 
mycoplasmal pneumonia, lung abscess, pulmonary tuberculosis, diffuse interstitial diseases, 
pneumoconioses, sarcoidosis, idiopathic pulmonary fibrosis, desquamative interstitial pneumonitis, 
hypersensitivity pneumonitis, pulmonary eosinophilia bronchiolitis obliterans-organizing pneumonia, 
diffuse pulmonary hemorrhage syndromes, Goodpasture's syndromes, idiopathic pulmonary 
25 hemosiderosis, pulmonary involvement in collagen-vascular disorders, pulmonary alveolar 
proteinosis, lung tumors, inflammatory and noninflammatory pleural effusions, pneumothorax, 
pleural tumors, drug-induced lung disease, radiation-induced lung disease, and complications of lung 
transplantation; a reproductive disorder such as a disorder of prolactin production, infertility, 
including tubal disease, ovulatory defects, and endometriosis, a disruption of the estrous cycle, a 
disruption of the menstrual cycle, polycystic ovary syndrome, ovarian hyperstimulation syndrome, an 
endometrial or ovarian tumor, a uterine fibroid, autoimmune disorders, an ectopic pregnancy, and 
teratogenesis; cancer of the breast, fibrocystic breast disease, and galactorrhea; a disruption of 
spermatogenesis, abnormal sperm physiology, cancer of the testis, cancer of the prostate, benign 
prostatic hyperplasia, prostatitis, Peyronie's disease, impotence, carcinoma of the male breast, and 
gynecomastia; an immune disorder such as inflammation, actinic keratosis, acquired 
immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory distress syndrome. 
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allergies, ankylosing spondylitis, amyloidosis, anemia, arteriosclerosis, asthma, atherosclerosis, 
autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune polyendocrinopathy- 
candidiasis^ectodermal dystrophy (APECED), bronchitis, bursitis, cholecystitis, cirrhosis, contact 
dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis. diabetes mellitus. emphysema. 
5 episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic 
gastritis, glomerulonephritis. Goodpasture's syndrome, gout. Graves' disease. Hashimoto's 
thyroiditis, paroxysmal nocturnal hemoglobinuria, hepatitis, hypereosinophilia. irritable bowel 
syndrome, mixed connective tissue disease (MCTD). multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflammation, myelofibrosis, osteoarthritis, osteoporosis, pancreatitis. 
10 polycythemia vera, polymyositis, psoriasis. Reiter's syndrome, rheumatoid arthritis, scleroderma, 
Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, primary 
thrombocythemia. thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, 
complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, 
parasitic, protozoal, and helminthic infections, trauma, and hematopoietic cancer including 
15 lymphoma, leukemia, and myeloma; and a cell proliferative disorder such as actinic keratosis, 
arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease 
(MCTD). myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, primary 
thrombocythemia, and cancers including adenocarcinoma, leukemia, lymphoma, melanoma, 
myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, 
20 bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, 
lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, 
thymus, thyroid, and uterus. The polynucleotide sequences encoding TPPT may be used in Southern 
or northern analysis, dot blot, or other membrane-based technologies; in PCR technologies; in 
dipstick, pin, and multiforme ELISA-like assays; and in microarrays utilizing fluids or tissues from 
25 patients to detect altered TPPT expression. Such qualitative or quantitative methods are well known 
in the art. 

In a particular aspect, the nucleotide sequences encoding TPPT may be useful in assays that 
detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding TPPT may be labeled by standard methods and added to a fluid or tissue sample 

30 from a patient under conditions suitable for the formation of hybridization complexes. After a 
suitable incubation period, the sample is washed and the signal is quantified and compared with a 
standard value. If the amount of signal in the patient sample is significantly altered in comparison to a 
control sample then the presence of altered levels of nucleotide sequences encoding TPPT in the 
sample indicates the presence of the associated disorder. Such assays may also be used to evaluate 

35 the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to 
monitor the treatment of an individual patient. 
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In order to provide a basis for the diagnosis of a dis rder associated with expression of TPPT, 
a normal or standard profile for expression is established. This may be accomplished by combining 
body fluids or c H extracts taken from n rmal subjects, either animal or human, with a sequence, or a 
fragment thereof, encoding TPPT, under conditions suitable for hybridization or amplification. 
Standard hybridization may be quantified by comparing the values obtained from normal subjects 
with values from an experiment in which a known amount of a substantially purified polynucleotide is 
used. Standard values obtained in this manner may be compared with values obtained from samples 
from patients who are symptomatic for a disorder. Deviation from standard values is used to establish 
the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that which is observed in the normal subject. The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the 
development of the disease, or may provide a means for detecting the disease prior to the appearance 
of actual clinical symptoms. A more definitive diagnosis of this type may allow health professionals 
to employ preventative measures or aggressive treatment earlier thereby preventing the development 
) or further progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding TPPT 
may involve the use of PCR. These oligomers may be chemically synthesized, generated 
enzymatically, or produced jnvjtro. Oligomers will preferably contain a fragment of a polynucleotide 
encoding TPPT, or a fragment of a polynucleotide complementary to the polynucleotide encoding 
TPPT, and will be employed under optimized conditions for identification of a specific gene or 
condition. Oligomers may also be employed under less stringent conditions for detection or 
quantification of closely related DNA or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from the polynucleotide sequences 
encoding TPPT may be used to detect single nucleotide polymorphisms (SNPs). SNPs are 
substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic disease 
in humans. Methods of SNP detection include, but are not limited to. single-stranded conformation 
polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers 
derived from the polynucleotide sequences encoding TPPT are used to amplify DNA using the 
polymerase chain reaction (PCR). The DNA may be derived, for example, from diseased or normal 
tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause differences in the 
secondary and tertiary structures of PCR products in single-stranded form, and these differences are 
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detectable using gel electrophones in non^enaturing gels. In fSCCP. the oligonucleotide primers 
are fluorescent* labeled, which a .l ws detection of the amplimers in high-throughput equipment such 
as DNA sequencing machines. Additionally, sequence database analysis methods, termed in silico 
SNP (isSNP). are capable of identifying polymorphisms by comparing the sequence of individual 
5 overlapping DNA fragments which assemble into a common consensus sequence. These computer- 
based methods filter out sequence variations due to laboratory preparation of DNA and sequencing 
errors using statistical models and automated analyses of DNA sequence chromatograms. In the 
alternative. SNPs may be detected and characterized by mass spectrometry using, for example, the 
high throughput MASSARRAY system (Sequenom. Inc., San Diego CA). 

10 Methods which may also be used to quantify the expression of TPPT include relabeling or 

biotinylating nucleotides, ^amplification of a control nucleic acid, and interpolating results from 
standard curves. (See. e.g.. Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa. C 
et al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 
accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide of 

15 .merest is presented in various dilutions and a spectrophotometry or colorimetric response gives rapid 
quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as elements on a rnicroarray. The microarray 
can be used in transcript imaging techniques which monitor the relative expression levels of large 

20 numbers of genes simultaneously as described in Seilhamer. J J. et al., "Comparative Gene Transcript 
Analysis," U.S. Patent No. 5,840,484, incorporated herein by reference. The microarray may also be 
used to identify genetic variants, mutations, and polymorphisms. This information may be used to 
determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, to 
monitor progression/regression of disease as a function of gene expression, and to develop and 

25 momtor the activities of therapeutic agents in the treatment of disease. In particular, this information 
may be used to develop a pharmacogenomic profile of a patient in order to select the most appropriate 
and effective treatment regimen for that patient. For example, therapeutic agents which are highly 
effective and display the fewest side effects may be selected for a patient based on his/her 
pharmacogenomic profile. 

30 In another embodiment, antibodies specific for TPPT. or TPPT or fragments thereof may be 

used as elements on a microarray. The microarray may be used to monitor or measure protein-protein 
interactions, drug-target interactions, and gene expression profiles, as described above. 

Microarrays may be prepared, used, and analyzed using methods known in the art (See e g 
Brennan, T.M. et aJ. (1995) U.S. Patent No. 5 ;474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci 

H USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application WG95/251 1 16; Shalon, D. et al 
(1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150- 
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2155; and Heller, M.J. et al. ( 1997) U.S. Patent No. 5.605,662.) Various types of microarrays are well 
known and thoroughly described in DNA Micmarravs- a Practical Ap pma^h M . Schena , ed . (1999) 
Oxford University Press, London, hereby expressly incorporated by reference. 

In another embodiment of the invention, nucleic acid sequences encoding TPPT may be used 
5 to generate hybridization probes useful in mapping the naturally occurring genomic sequence. Either 
coding or noncoding sequences may be used, and in some instances, noncoding sequences may be 
preferable over coding sequences. For example, conservation of a coding sequence among members 
of a multi-gene family may potentially cause undesired cross hybridization during chromosomal 
mapping. The sequences may be mapped to a particular chromosome, to a specific region of a 
10 chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes (HACs). 
yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 
constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J.J. et al. (1997) Nat. 
Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, BJ. (1991) Trends Genet. 
7: 149-154.) Once mapped, the nucleic acid sequences of the invention may be used to develop 
15 genetic linkage maps, for example, which correlate the inheritance of a disease state with the 

inheritance of a particular chromosome region or restriction fragment length polymorphism (RFLP). 
(See, e.g.. Lander, E.S. and D. Botstein (1986) Proc. Natl. Acad. Sci. USA 83:7353-7357.) 

Fluorescent insitu hybridization (FISH) may be correlated with other physical and genetic 
map data. (See. e.g., Heinz-Ulrich, et al. (1995) in Meyers, supra, pp. 965-968.) Examples of genetic 
20 map data can be found in various scientific journals or at the Online Mendelian Inheritance in Man 
(OMIM) World Wide Web site. Correlation between the location of the gene encoding TPPT on a 
physical map and a specific disorder, or a predisposition to a specific disorder, may help define the 
region of DNA associated with that disorder and thus may further positional cloning efforts. 

Insitu hybridization of chromosomal preparations and physical mapping techniques, such as 
25 linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, may 
reveal associated markers even if the exact chromosomal locus is not known. This information is 
valuable to investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once the gene or genes responsible for a disease or syndrome have been crudely 
30 localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23. 
any sequences mapping to that area may represent associated or regulatory genes for further 
investigation. (See, e.g., Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide sequence of 
the instant invention may also be used to detect differences in the chromosomal location due to 
translocation, inversion, etc., among normal, carrier, or affected individuals. 
35 In another embodiment of the invention. TPPT. its catalytic or immunogenic fragments, or 

oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
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screening technique, IT* fragment employed in such screening may be free in solution, affixed to a 
sohd suppon, borne on a cell surface, or located intracellular^ . Th formation of binding complexes 
between TPPT and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 
5 havmg smtable binding affinity to the protein of interest. (See, e.g., Geysen, et al. (1984) PCT 
application WO84/03564.) In this method, large numbers of different small test compounds are 
synthestzed on a solid substrate. The test compounds are reacted with TPPT, or fragments thereof 
and washed. Bound TPPT is then detected by methods well known in the art. Purified TPPT can also 
be coated directly onto plates for use in the aforementioned drug screening technique, Alternatively 
10 non-neutralizing antibodies can be used to capture the peptide and immobilize it on a solid support. 

In another embodiment, one may use competitive drag screening assays in which neutralizing 
anubodies capable of binding TPPT specifically compete with a test compound for binding TPPT In 
th.s manner, antibodies can be used to detect the presence of any peptide which shares one or more 
antigenic determinants with TPPT. 

■5 In additional embodiments, the nucleotide sequences which encode TPPT may be used in any 

molecular biology techniques that have yet to be developed, provided the new techniques rely on 
propen.es of nucleotide sequences that are currently known, including, but no, limited to, such 
properties as the triplet generic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 

20 descnpnon, utilize the present invention to its fullest extent. The following preferred specific 

embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder 
of the disclosure in any way whatsoever. 

The disclosures of all patents, applications and publications, mentioned above and below in 
particular U.S. Ser. No. 60/139,923, U.S. Ser. No. 60/148,177, U.S. Ser. No. 60/149,357, and US 

25 Ser. No. 60/162,287, are hereby expressly incorporated by reference 
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EXAMPLES 

I. Construction of cDNA Libraries 

RNA was purchased from Clontech or isolated from tissues described in Table 4. Some 
tissues were homogenized and lysed in guanidinium isothiocyanate, whiJe others were homogenized 
and lysed ,n phenol or in a suitable mixture of denaturants, such as TRTZOL (Life Technologies) a 
monophasic solution of phenol and guanidine isothiocyanate. The resulting .ysates were centrifuged 
over CsCl cushions or extracted with chloroform. RNA was precipitated from the lysates with either 
•sopropanol or sodium acetate and ethanol, or by other routine methods. 
35 Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 

punty. I„ some cases, RNA was treated with DNase. For most libraries, ooly(A + ) RNA was isolated 
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using oligo d(T>coupIed paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN 
Chatsworth CA), or an OUGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was 
.solated directly from tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE mRNA 
purification kit (Amnion, Austin TX). 
5 In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 

libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNEAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 
recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1997, su^, units 
5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 
10 oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the 
appropnate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300-1000 
bp) using SEPHACRYL S1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column 
chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were hgated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g 
15 PBLUESCRTPT plasmid (Stratagene). PSPORT1 plasmid (Life Technologies), pcDNA2.1 plasmid 
(Invitrogen, Carlsbad CA), or pINCY plasmid (Incyte Genomics, Palo Alto CA). Recombinant 
plasmids were transformed into competent Rcoli cells including XL 1 -Blue, XLl-BlueMRF, or 
SOLR from Stratagene or DH5a, DH10B, or ElectroMAX DH10B from Life Technologies. 
H. Isolation of cDNA Clones 
20 Plasmids obtained as described in Example I were recovered from host cells by jnjviyo 

excision using the UNEAP vector system (Stratagene) or by cell lysis. Plasmids were purified using 
at least one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an 
AGTC M,n,prep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid 
QIAWELL 8 Plus Plasmid. QIAWELL 8 Ultra Plasmid purification systems or the R.E.A L PREP 96 
25 plasmtd purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0.1 
ml of distilled water and stored, with or without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 
high-throughput forma. (Rao. V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 
30 384-well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically 
using PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN D fluorescence 
scanner (Labsystems Oy, Helsinki, Finland). 
HI. Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example II were sequenced as follows 
35 Sequencing reactions were processed using standard methods or high-throughput instrumentation 
such as the ABI CATALYST 800 (PE Biosystems) thermal cyder or the PTC-200 thermal cycler (MJ 
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Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the MICROLAB 
2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared using reagents 
provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such as the ABI 
PRISM BIGDYE Terminator cycle sequencing ready reaction kit (PE Biosystems). Electrophoretic 
separation of cDNA sequencing reactions and detection of labeled polynucleotides were carried out 
using the MEGABACE 1000 DNA sequencing system (Molecular Dynamics); the ABI PRISM 373 or 
377 sequencing system (PE Biosystems) in conjunction with standard ABI protocols and base calling 
software; or other sequence analysis systems known in the art. Reading frames within the cDNA 
sequences were identified using standard methods (reviewed in Ausubel. 1997, supra, unit 7.7). Some 
of the cDNA sequences were selected for extension using the techniques disclosed in Example VI. 

The polynucleotide sequences derived from cDNA sequencing were assembled and analyzed 
using a combination of software programs which utilize algorithms well known to those skilled in the 
an. Table 5 summarizes the tools, programs, and algorithms used and provides applicable 
descriptions, references, and threshold parameters. The first column of Table 5 shows the tools, 
programs, and algorithms used, the second column provides brief descriptions thereof, the third 
column presents appropriate references, all of which are incorporated by reference herein in their 
entirety, and the fourth column presents, where applicable, the scores, probability values, and other 
parameters used to evaluate the strength of a match between two sequences (the higher the score, the 
greater the homology between two sequences). Sequences were analyzed using MACDNASIS PRO 
software (Hitachi Software Engineering, South San Francisco CA) and LASERGENE software 
(DNASTAR). Polynucleotide and polypeptide sequence alignments were generated using the default 
parameters specified by the clustal algorithm as incorporated into the MEGALIGN multisequence 
alignment program (DNASTAR), which also calculates the percent identity between aligned 
sequences. 

The polynucleotide sequences were validated by removing vector, linker, and polyA 
sequences and by masking ambiguous bases, using algorithms and programs based on BLAST, 
dynamic programing, and dinucleotide nearest neighbor analysis. The sequences were then queried 
against a selection of public databases such as the GenBank primate, rodent, mammalian, vertebrate, 
and eukaryote databases, and BLOCKS. PRINTS, DOMO, PRODOM, and PFAM to acquire 
annotation using programs based on BLAST, FASTA. and BLIMPS. The sequences were assembled 
•nto full length polynucleotide sequences using programs based on Phred, Phrap, and Consed, and 
were screened for open reading frames using programs based on GeneMark, BLAST, and FASTA. 
The full length polynucleotide sequences were translated to derive the corresponding full length 
amino acid sequences, and these full length sequences were subsequently analyzed by querying 
against databases such as the GenBank databases (described above), SwissProt, BLOCKS, PRINTS, 
DOMO. PRODOM, Prosite, and Hidden Markov Model (HMM)-based protein family databases such 
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as PFAM. HMM is a probabilistic approach which analyzes consensus primary structures of gene 
families. (See, e.g., Eddy, S.R. (1996) Curr. Opin. Struct. Biol. 6:361-365.) 

The programs described above for the assembly and analysis of full length polynucleotide and 
amino acid sequences were also used to identify po.ynucleotide sequence fragments from SEQ ID 
NO:44-86. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
amplification technologies were described in The Invention section above. 
IV. Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 
gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs 
from a particular cell type or tissue have been bound. (See, e.g., Sambrook, supra, ch. 7; Ausubel, 
1995, supra, ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or related 
molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is 
much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the computer 
search can be modified to determine whether any particular match is categorized as exact or similar. 
The basis of the search is the product score, which is defined as: 

BLAST Sc ore x Perppnt Identity 

5 x minimum {length(Seq. 1), length(Seq. 2)} 

20 Tlie product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. The product score is a normalized value between 0 and 100 and is 
calculated as follows: the BLAST score is multiplied by the percent nucleotide identity and the 
product is divided by (5 times the length of the shorter of the two sequences). The BLAST score is 
calculated by assigning a score of + 5 for every base that matches in a high-scoring segment pair 

25 (HSP), and -4 for every mismatch. Two sequences may share more than one HSP (separated by gaps) 
If there is more than one HSP. then the pair with the highest BLAST score is used to calculate the 
product score. The product score represents a balance between fractional overlap and quality in a 
BLAST alignment. For example, a product score of 100 is produced only for 100% identity over the 
entire length of the shorter of the two sequences being compared. A product score of 70 is produced 

30 either by 100% identity and 70% overlap a, one end, or by 88% identity and 100% overlap a. the 

other. A product score of 50 is produced either by 100% identity and 50% overlap at one end, or 79% 
identity and 100% overlap. 

The results of northern analyses are reported as a percentage distribution of libraries in which 
the transcript encoding TPPT occurred. Analysis involved the categorization of cDNA libraries by 
35 organ/tissue and disease. The organ/tissue categories included cardiovascular, dermato.ogic, 
developmental, endocrine, gastrointestinal, hematopoietic/immune. musculoskeletal, nervous. 
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reproductive, and uro.ogic. The disease/condition categories included cancer, inflation, trauma, 
cell prouferation. neurological, and pooled. For each category, the number of libraries expressing the 
sequence of interest was counted and divided by the total number of libraries across all categories. 
Percentage values of tissue-specific and disease- or condition-specific expression are reported in 
5 Table 3. 

V. Chromosomal Mapping ofTPPT Encoding Polynucleotides 

The cDNA sequences which were used to assemble SEQ ID NO.44-49 and SEQ ID NO:54-86 
were compared with sequences from the Incyte LIFESEQ database and public domain databases 
using BLAST and other implementations of the Smith-Waterman algorithm. Sequences from these 
10 databases that matched SEQ ID NO:44^9 and SEQ ID NO:54-86 were assembled into clusters of 
contiguous and overlapping sequences using assembly algorithms such as Phrap (Table 5). Radiation 
hybnd and genetic mapping data available from public resources such as the Stanford Human 
Genome Center (SHGC), Whitehead Institute for Genome Research (WIGR). and Genethon were 
used to determine if any of the clustered sequences had been previously mapped. Inclusion of a 
15 mapped sequence in a cluster resulted in the assignment of all sequences of that cluster, including its 
particular SEQ ID NO:, to that map location. 

The genetic map locations of SEQ ID NO:44. SEQ ID NO:48, SEQ ID NO:60. SEQ ID 
NO:65, SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:71. SEQ ID NO:73. SEQ ID NO:76, SEQ ID 
NO:80,and SEQ ID NO:83 are described in The Invention as ranges, or intervals, of human 
20 chromosomes. More than one map location is reported for SEQ ID NO:65. SEQ ID NO:73. SEQ ID 
NO:80. and SEQ ID NO:83, indicating that previously mapped sequences having similarity, but not 
complete identity, to SEQ ID NO:65. SEQ ID NO:73, SEQ ID NO:80, and SEQ ID NO:83 were 
assembled into their respective clusters. The map position of an interval, in centiMorgans. is 
measured relative to the terminus of the chromosome's p-arm. (The centiMorgan (cM) is a unit of 
25 measurement based on recombination frequencies between chromosomal markers. On average, 1 cM 
is roughly equivalent to 1 megabase (Mb) of DNA in humans, although this can vary widely due to 
hot and cold spots of recombination.) The cM distances are based on genetic markers mapped by 
Genethon which provide boundaries for radiation hybrid markers whose sequences were included in 
each of the clusters. Diseases associated with the public and Incyte sequences located within the 
30 mdicated intervals are also reported in the Invention section where applicable. Human genome maps 
and other resources available to the public, such as the NCBI "GeneMap'99" World Wide Web site 
(http://www.ncbi.nlm.nih.gov/genemap/), can be employed to determine if previously identified 
disease genes map within or in proximity to the intervals indicated above. 
VI. Extension of TPPT Encoding Polynucleotides 
35 The full length nucleic acid sequences of SEQ ID NO:44-86 were produced by extension of 

an appropriate fragment of the full length molecule using oligonucleotide primers designed from this 
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fragment. One primer was synthesized to initiate 5'extension of me known fragment, and the other 
pnmer, to initiate 3'extension of the known fragment. The initial primers were designed using 
OUGO 4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 
nucleotides in length, to have a GC content of about 50% or more, and to anneal to the target 
5 sequence at temperatures of about 68'C to about 72°C. Any stretch of nucleotides which would 
result in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 
extension was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PCR using methods well known in the art PCR 
10 ""Panned in 96-wellp^ 

nux contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2 *, (NH^SO 
and P-mercaptoethanoI, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme 
(Life Technologies), and Pfu DNA polymerase (Stratagene), with the following parameters for primer 
pau PCI A and PCI B: Step 1 : 94°C, 3 min; Step 2: 94°C. ,5 sec; Step 3: 60*C. 1 min; Step 4- 68'C 
15 2 mm; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68'C. 5 min; Step 7: storage at 4'C. In the 
alternate, the parameters for primer pair T7 and SK + were as follows: Step 1 : 94'C. 3 min- Step 2- 
94-C, 15 sec; Step 3: 57°C. 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3. and 4 repeated 20 times; 
Step 6: 68 °C, 5 min; Step 7: storage at 4°C. 

The concentration of DNA in each well was determined by dispensing 100 ul PICOGREEN 
20 quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 ul of undiluted PCR product into each well of an opaque fluorimeter plate (Coming Costar 
Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan H 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
concentration of DNA. A 5 »\ to 10^1 aliquot of the reaction mixture was analyzed by 
25 electrophoresis on a 1 % agarose mini-gel to determine which reactions were successful in extending 
the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-well plates 
digested with CviJI cholera virus endonuclease (Molecular Biology Research. Madison WI), and ' 
somcated or sheared prior to religation into P UC 1 8 vector (Amersham Pharmacia Biotech) For 
30 shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 

agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended clones 
were rehgated using T4 ligase (New England Biolabs, Beverly MA) into pUC 1 8 vector (Amersham 
Pharmaca Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site 
overhangs, and transfected into competent Ecoli cells. Transformed cells were selected on 
35 annbiotic-comaining media, and individual colonies were picked and cultured overnight at 37»C in 
384-well plates in LB/2x carb liquid media. 
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The cells were l yse d, and DNA was amplified by PGR using Taq DNA polymerase 
(Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the foil wing 
parameters: Step 1: 94'C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72'C, 2 min- 
Step 5: steps 2, 3. and 4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4'C. DNA was 
5 quantified by PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA 
recovenes were reamplified using the same conditions as described above. Samples were diluted with 
20% dtmethysulfoxide (1 :2. v/v). and sequenced using DYENAMIC energy transfer sequencing 
pnmers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM 
BIGDYE Terminator cycle sequencing ready reaction kit (PE Biosystems). 
10 In like manner, the polynucleotide sequences of SEQ ID NO:44-86 are used to obtain 5' 

regulatory sequences using the procedure above, along with oligonucleotides designed for such 
extension, and an appropriate genomic library. 
Vn. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:44-86 are employed to screen cDNAs 
15 genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 
pairs, ,s specifically described, essentially the same procedure is used with larger nucleotide 
fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4 06 
software (National Biosciences) and labeled by combining 50 pmol of each oligomer. 250 „Ci of 
[y-*P] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 
20 (DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a 

SEPHADEX G-25 superfine size exclusion dextran bead column (Amersham Pharmacia Biotech) An 
ahquot containing 10'counts per minute of the labeled probe is used in a typical membrane-based 
hybnd.zat.on analysis of human genomic DNA digested with one of the following endonucleases: Ase 
I, Bgl H, Eco RL Pst I, Xba I. or Pvu U (DuPont NEN). 
25 The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 

membranes (Nytran Plus, Schleicher & Schuell. Durham NH). Hybridization is carried out for 16 
hours at 40-C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0. 1 x saline sodium citrate and 0.5% sodium dodecyl sulfate 
Hybndization patterns are visualized using autoradiography or an alternative imaging means and 
30 compared. 

Vm. Microarrays 

The linkage or synthesis of array elements upon a microarray can be achieved utilizing 
photolithography, piezoelectric printing (ink-jet printing. See, e.g., Baldeschweiler, supra), 
mechanical microspotting technologies, and derivatives thereof. The substrate in each of the 
35 aforementioned technologies should be uniform and solid with a non-porous surface (Schena (1999) 
supra). Suggested substrates include silicon, silica, glass slides, glass chips, and silicon wafers. 
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Alternatively, a procedure analogous to a dot r slot blot may also be used to arrange and link 
elements to the surface of a substrate using thermal, UV, chemical, or mechanical bonding 
procedures. A typical array may be produced using available methods and machines well known to 
those of ordinary skill in the art and may contain any appropriate number of elements. (See, e.g.. 
5 Schena, M. et al. (1995) Science 270:467-170; Shalon, D. et al. (1996) Genome Res. 6:639-645; 
Marshall, A. and J. Hodgson (1998) Nat. Biotechnol. 16:27-31.) 

Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof 
may comprise the elements of the microarray. Fragments or oligomers suitable for hybridization can 
be selected using software well known in the art such as LASERGENE software (DNASTAR). The 
10 array elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 
biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection. 
After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 
fluorescence scanner is used to detect hybridization at each array element. Alternatively, laser 
desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 
15 complementarity and the relative abundance of each polynucleotide which hybridizes to an element on 
the microarray may be assessed. In one embodiment, microarray preparation and usage is described 
in detail below. 

Tissue or Cell Sa mple Prep aration 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 

20 polyCA)* RNA is purified using the oligo-(dT) cellulose method. Each poly(A)* RNA sample is 

reverse transcribed using MMLV reverse-transcriptase, 0.05 pg/ul oligo-(dT> primer (21mer), IX firs, 
strand buffer, 0.03 units/uJ RNase inhibitor, 500 uM dATP, 500 uM dGTP, 500 pM dTTP, 40 uM 
dCTP, 40 uM dCTP-Cy3 (BDS) or dCTP-C y 5 (Amersham Pharmacia Biotech). The reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng poly(A) + RNA with 

25 GEMB RIGHT kits (Incyte). Specific control poly(A)* RNAs are synthesized by jnvjtro. transcription 
from non-coding yeast genomic DNA. After incubation at 37 °C for 2 hr, each reaction sample (one 
with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and 
incubated for 20 minutes at 85 °C to the stop the reaction and degrade the RNA. Samples are purified 
using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. 

30 (CLONTECH), Palo Alto CA) and after combining, both reaction samples are ethanol precipitated 
using 1 ml of glycogen (1 mg/ml). 60 ml sodium acetate, and 300 ml of 100% ethanol. The sample is 
then dried to completion using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and 
resuspended in 14 ul 5X SSC/0.2% SDS. 
MicroarraY Preparation 

35 Sequences of the present invention are used to generate array elements. Each array element 

is amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification 
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uses primers complementary ,o the vector sequences flanking the cDNA insert. Amy eiements are 
amplified in thirty cyc.es of PGR from an initial quantity of 1-2 ng to a fi na , quantity ^ ^ 5 
ug. Amphfied array elements are men purified using SEPHACRYL-400 (Amersham Pharmacia 

Biotech). 

Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 
shdes (Coming) are cleaned by ultrasound in 0. 1 % SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
Sc.ent.fic Products Corporation (VWR), West Chester PA), washed extensively in distilled water, and 
coated w.th 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 1 10°C 



10 oven 



Array elements are applied to the coated glass substrate using a procedure described in US 
Patent No. 5,807,522. incorporated herein by reference. 1 ul of the array element DNA, at an average 
concentration of ,00 ng/ul. is loaded into the open capillary printing element by a high-speed robotic 
apparatus. The apparatus then deposits about 5 nl of array element sample per slide 
15 Microarrays are UV-crossIinked using a STRATALINKER UV-crosslinker (Stratagene) 

Mtcroarrays are washed a, room temperature once in 0.2% SDS and three times in distilled water 
Non-specfic binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix. Inc., Bedford MA) for 30 minutes at 60'C followed by washes in 0.2% 
SDS and distilled water as before. 
20 Hybridization 

Hybridization reactions contain 9 uJ of sample mixture consisting of 0.2 ug each of Cy3 and 
Cy5 .abeled cDNA synthesis products in 5X SSC. 0.2% SDS hybridization buffer. The sample 
m.xture is heated to 65 'C for 5 minutes and is aliquoted onto the microarray surface and covered with 

an LScm-coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly 
larger than a slide . ^ chamber . $ ^ ^ m% ^ ^ ^ 

140 pi of 5X SSC in a comer of the chamber. The chamber containing the arrays is incubated for 
about 6.5 hours a, 60°C. The arrays are washed for 10 min at 45 °C in a first wash buffer (IX SSC 
0.1% SDS), three rimes for ,0 minutes each at 45 'C in a second wash buffer (0.1X SSC), and dried 
Detection 

30 Reporter-labeled hybridization complexes are detected with a microscope equipped with an 

Innova 70 mixed gas 1 0 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 
focused on the array using a 20X microscope objective (Nikon. Inc.. Melville NY). The slide 
contamtng the array is placed on a computer-controlled X-Y stage on the microscope and raster- 

35 scanned past the objective. The ,.8cmx 1 .8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 
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In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 
Enutted hght is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477 
Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
filters posmoned between the array and the photomultiplier tubes are used to filter the signals The 
5 ^onn^rnaofmenuorophoresuse^areSeSnmforCySandeSOnmforCyS. Eacharrayis 
ryptcally scaled twice, one scan per fluorophore using the appropriate filters at the laser source 
although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensify generated by a 
cDNA control species added to the sample mixture at a known concentration. A specific location on 
10 the array contains a complementary DNA sequence, allowing the intensity of the signal at that 

locanon to be correlated with a weight ratio of hybridizing species of 1:100.000. When two samples 
from drfferent sources (e.g.. representing test and control cells), each labeled with a different 
fluorophore, are hybridized to a single array for the purpose of identifying genes that are differentially 
expressed, the calibration is done by labeling samples of the calibrating cDNA with the two 
15 fluorophores and adding identical amounts of each to the hybridization mixture. 

The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital 
(A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 
computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
hnear 20-colcr transformation to a pseudocolor scale ranging from blue (low signal) to red (high 
20 s.gnal). Tne data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping 
enuss,on spectra) between the fluorophores using each fluorophore's emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each spot 
-s centered in each element of the grid. The fluorescence signal within each element is then integrated 
to obtam a numerical value corresponding to the average intensity of the signal. The software used 
for s>gnal analysis is the GEMTOOLS gene expression analysis program (Incyte). 
K. Complementary Polynucleotides 

Sequences complementary to the TPPT^ncoding sequences, or any parts thereof, are used to 
detect, decrease, or inhibit expression of naturally occurring TPPT. Although use of oligonucleotides 
compnsmg from about 15 to 30 base pairs is described, essentially the same procedure is used with 
smaller or w,th larger sequence fragments. Appropriate oligonucleotides are designed using OLIGO 
4.06 software (National Biosciences) and the coding sequence of TPPT. To inhibit transcription a 
complementary oligonucleotide is designed from the most unique 5'sequence and used to prevent 
promoter bmding to the coding sequence. To inhibit translation, a complementary oligonucleotide is 
35 des.gned to prevent ribosomal binding to the TPPT-encoding transcript. 
x - Expression of TPPT 
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Expression and purification of TPPT is achieved using bacterial or virus-based expression 
systems. Forexpressi n of TPPT in bacteria, cDNA is subcloned into an appropriate vector 
contamtng an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 
transcnpnon. Examples of such promoters include, bu, are not limited to, the trp-lac (tac) hybrid 
5 promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 
element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3) 
Antibiotic resistant bacteria express TPPT upon induction with isopropyl beta-D- 
th.ogalactopyranoside (IPTG). Expression of TPPT in eukaryotic cells is achieved by infecting insect 
or mammalian cell Hnes with recombinant Autographica californin, nuclear polyhedrosis virus 
10 (AcMNPVXcommonlyknownasbaculovirus. The nonessential polyhedrin gene of baculovirus is 
replaced with cDNA encoding TPPT by either homologous recombination or bacterial-mediated 
transposition involving transfer p.asmid intermediates. Viral inactivity is maintained and the strong 
polyhednn promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
- ect Sp^dopterafrB^iperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 
Infecttonofthelatterrequiresadditionalgeneticmc^iificationstobaculov^ (See Engelhard EK 

jwmT mi AC3d ' Sd ' USA 91:3224 " 322?; Sandig> V - et * (1996) Hum - 06116 ^ ' 

In most expression systems, TPPT is synthesized as a fusion protein with, e.g., glutathione S- 
transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
affintty-basedpurificationofrecombinantfusionproteinfromcrudecelllysates. GST a26- 
kdodalton enzyme from Schistosoma iaponjn.m, enables the purification of fusion proteins on 
.rnmob.lized glutathione under conditions that maintain protein activity and antigenicity (Amersham 
Pharmacia Biotech). Following purification, the GST moiety can be proteolytically cleaved from 
TPPT at specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffmity 
25 punficatton using commercially avaiiab.e monoclona. and po.yclona. anti-FLAG antibodies (Eastman 
Kodak). 6-H.s, a stretch of six consecutive histidine residues, enables purification on metal-chelate 
resms (QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995 
HUD.* lOand 16). Purified TPPT obtained by these methods can be used directly in the assays ' 
shown in Examples XI and XV. 

30 XI. Demonstration of TPPT Activity 

TPPT transport activity is assayed by measuring uptake of labeled substrates into Xenopus. 
Jaevjs oocytes. Oocytes at stages V and VI are injected with TPPT mRNA (,0 n g per oocyte) and 
.ncubated for 3 days at ,8'C in OR2 medium (82.5mM NaCl, 2.5 mM KC1, ImM CaC»„ ImM MgCl, 
ImM Na J HP0 4 , 5 mM Hepes, 3.8 mM NaOH , 50ug/ml gentamycm, pH 7.8) to allow expression of 
TPPT. Oocytes are then transferred to standard uptake medium (lOOmM NaCl, 2 mM KCMmM 
CaC 2 , ImM MgC, 2 , ,0 mM Hepes^ris pH 7.5). Uptake of various substrates (e.g.. amino'acids 
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sugars, drugs, ions, and neurotransmitters) is initiated by adding labeled substrate (e.g. radiolabeled 
w,th *. fluorescent* labeled with rhodamine, etc.) to the oocyte, After incubating for 30 minutes, 
uptake ,s terminated by washing the oocytes three times in Na'-free medium, measuring the 
incorporated label, and comparing with controls. TPPT activity is proportional to the level of 
5 internalized labeled substrate. 
XII. Functional Assays 

TPPT function is assessed by expressing the sequences encoding TPPT at physiologically 
elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 
vector containing a strong promoter that drives high levels of cDNA expression. Vectors of choice 
10 include P CMV SPORT plasmid (Life Technologies) and pCR3. 1 plasmid (Invitrogen), both of which 
contain the cytomegalovirus promoter. 5-10^g of recombinant vector are transiently transited into 
a human cell line, for example, an endothelial or hematopoietic cell line, using either liposome 
formulations or electroporation. 1-2 n of an additional plasmid containing sequences encoding a 
marker protein are co-transfected. Expression of a marker protein provides a means to distinguish 
15 transfected cells from nontransfected cells and is a reliable predictor of cDNA expression from the 
recombinant vector. Marker proteins of choice include, e.g., Green Fluorescent Protein (GFP- 
Clontech). CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), an automated, laser optics- 
based techntque, is used to identify transfected cells expressing GFP or CD64-GFP and to evaluate 
the apoptotic state of the cells and other cellular properties. FCM detects and quantifies the uptake of 
20 fluorescent molecules that diagnose events preceding or coincident with cell death. These events 
include changes in nuclear DNA content as measured by staining of DNA with propidium iodide; 
changes in cell size and granularity as measured by forward light scatter and 90 degree side light 
scatter; down-regulation of DNA synthesis as measured by decrease in bromodeoxyuridine uptake- 
alteranons in expression of cell surface and intracellular proteins as measured by reactivity with 
25 specific antibodies; and alterations in plasma membrane composition as measured by the binding of 
fluotescein-conjugated Annexin V protein to the cell surface. Methods in flow cytometry are 
discussed in Ormerod, M.G. (1994) Flow Cvto™,rv Oxford. New York NY. 

The influence of TPPT on gene expression can be assessed using highly purified populations 
of cells transfected with sequences encoding TPPT and either CD64 or CD64-GFP. CD64 and CD64- 
30 GFP are expressed on the surface of transfected cells and bind to conserved regions of human 

.mmunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using 
magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake Success 
NY). mRNA can be purified from the cells using methods well known by those of skill in the art 
Expression of mRNA encoding TPPT and other genes of interest can be analyzed by northern analysis 
35 or microarray techn iques. 
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Xm. Production of TPPT Specific Antibodies 

TPPT substantially purified using polyacryiamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods Enzymol. 182:488-195), or other P urif,cation techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the TPPT amino acid sequence is analyzed using LASERGENE software 
(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the art. (See, e.g., Ausubel, 1995, supra, ch. 11.) 

Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431 A 
peptide synthesizer (PE Biosystems) using FMOC chemistry and coupled to KLH (Sigma-Aldrich, St. 
Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase 
immunogenicity. (See. e.g., Ausubel, 1995, supra.) Rabbits are immunized with the oligopeptide- 
KLH complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide and anti- 
TPPT activity by, for example, binding the peptide or TPPT to a substrate, blocking with 1% BSA, 
reacting with rabbit antisera. washing, and reacting with radio-iodinated goat anti-rabbit IgG. 
XIV. Purification of Naturally Occurring TPPT Using Specific Antibodies 

Naturally occurring or recombinant TPPT is substantially purified by immunoaffinity 
chromatography using antibodies specific for TPPT. An immunoaffinity column is constructed by 
covalently coupling anti-TPPT antibody to an activated chromatographic resin, such as 
CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufacturer's instructions. 

Media containing TPPT are passed over the immunoaffinity column, and the column is 
washed under conditions that allow the preferential absorbance of TPPT (e.g., high ionic strength 
buffers in the presence of detergent). The column is eluted under conditions that disrupt 
antibodwTPPT binding (e.g., a buffer of pH 2 to pH 3. or a high concentration of a chaotrope. such as 
urea or thiocyanate ion), and TPPT is collected. 
XV. Identification of Molecules Which Interact with TPPT 

TPPT, or biologically active fragments thereof, are labeled with '"I Bolton-Hunter reagent. 
(See, e.g., Bolton A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules 
previously arrayed in the wells of a multi-well plate are incubated with the labeled TPPT, washed, and 
any wells with labeled TPPT complex are assayed. Data obtained using different concentrations of 
TPPT are used to calculate values for the number, affinity, and association of TPPT with the 
candidate molecules. 

Alternatively, molecules interacting with TPPT are analyzed using the yeast two-hybrid 
system as described in Fields. S. and O. Song (1989. Nature 340:245-246). orusing commercially 
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available kits based n the two-hybrid system, such as the MATCHMAKER system (Clontech). 

TPPT may also be used in the PATHCALUNG process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a high-throughput manner to determine all interactions 
between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S Patent 
No. 6,057,101). 



10 



Various modifications and variations of the described methods and systems of the invention 
ill be apparent to those skilled in the art without departing from the scope and spirit of the invention. 
Although the invention has been described in connection with certain embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for carrying out the invention which are obvious 
to those skilled in molecular biology or related fields are intended to be within the scope of the 
following claims. 
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What is claimed is: 
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1. An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: 

5 a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1 , SEQ ID 

NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7. SEQ ID NO:8, SEQ ID NO:9, 
SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:13. SEQ ID NO:14, SEQ ID NO: 15, SEQ ID NO:16. 
SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, 
SEQ ID NO:23, SEQ ID NO:24. SEQ ID NO:25, SEQ ID NO:26. SEQ ID NO:27, SEQ ID NO:28, 
10 SEQ ID NO:29, SEQ ID NO:30. SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33. SEQ ID NO:34, 
SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:41. SEQ ID NO:42, 
and SEQ ID NO:43. 

b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID 

15 NO:3, SEQ ID NO:4. SEQ ID NO:6. SEQ ID NO:7, SEQ ID NO:8. SEQ ID NO:9. SEQ ID NO: 10, 
SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15. SEQ ID NO:16. SEQ ID NO:17, 
SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, 
SEQ ID NO:24. SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, 
SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32. SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, 

20 SEQIDNO:36,SEQIDNO:38,SEQIDNO:39,SEQIDNO:41,SEQIDNO:42,andSEQID 
NO:43, 

c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1 , SEQ ID NO:2. SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID 
NO:7, SEQ ID NO:8, SEQ ID NO:9. SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO: 13, SEQ ID 

25 NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID 
NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID 
NO:26, SEQ ID NO:27. SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30. SEQ ID NO:31, SEQ ID 
NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35. SEQ ID NO:36, SEQ ID NO:38, SEQ ID 
NO:39, SEQ ID NO:41 , SEQ ID NO:42, and SEQ ID NO:43, and 

30 d) an immunogenic fragment of an amino acid sequence selected from the group consisting 

of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ 
ID NO:8, SEQ ID NO:9. SEQ ID NO: 10. SEQ ID NO: 12. SEQ ID NO: 13. SEQ ID NO: 14. SEQ ID 
NO: 15. SEQ ID NO: 16. SEQ ID NO: 17, SEQ ID NO: 18. SEQ ID NO:19. SEQ ID NO:20. SEQ ID 
NO:21. SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26. SEQ ID 

35 NO:27. SEQ ID NO:28, SEQ ID NO:29. SEQ ID NO:30, SEQ ID NO:31, SEQ IDNO:32. SEQ ID 
NO:33. SEQ ID NO:34. SEQ ID NO:35. SEQ ID NO:36. SEQ ID NO:38, SEQ ID NO:39. SEQ ID 
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NO:41. SEQ ID NO:42, and SEQ ID NO:43. 



PCT/US00/16668 



2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO:l, 
SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6. SEQ ID NO:7, SEQ ID NO:8, SEQ ID 
NO:9, SEQ ID NO: 10. SEQ ID NO:12, SEQ ID NO: 13. SEQ ID NO: 14. SEQ ID NO:15. SEQ ID 
NO:16. SEQ ID NO:17. SEQ ID NO: 18. SEQ ID NO:19. SEQ ID NO:20. SEQ ID NO:21. SEQ ID 
NO:22. SEQ ID NO:23. SEQ ID NO:24. SEQ ID NO:25. SEQ ID NO:26, SEQ ID NO:27, SEQ ID 
NO:28. SEQ ID NO:29. SEQ ID NO:30. SEQ ID NO:3 1 , SEQ ID NO:32, SEQ ID NO:33. SEQ ID 
NO:34. SEQ ID NO:35. SEQ ID NO:36. SEQ ID NO:38, SEQ ID NO:39. SEQ ID NO:41, SEQ ID 
NO:42. and SEQ ID NO:43. 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

5. An isolated polynucleotide of claim 4 selected from the group consisting of SEQ ID 
NO:44. SEQ ID NO:45. SEQ ID NO:46. SEQ ID NO:47. SEQ ID NO:49. SEQ ID NO:50. SEQ ID 
NO:51. SEQ ID NO:52. SEQ ID NO:53. SEQ ID NO:55. SEQ ID NO:56. SEQ ID NO:57. SEQ ID 
NO:58. SEQ ID NO:59. SEQ ID NO:60, SEQ ID NO:61. SEQ ID NO:62, SEQ ID NO:63. SEQ ID 
NO:64, SEQ ID NO:65. SEQ ID NO:66, SEQ ID NO:67. SEQ ID NO:68, SEQ ID NO:69. SEQ ID 
NO:70. SEQ ID NO:71, SEQ ID NO:72, SEQ ID NO:73. SEQ ID NO:74, SEQ ID NO:75. SEQ ID 
NO:76. SEQ ID NO:77. SEQ ID NO:78, SEQ ID NO:79. SEQ ID NO:81, SEQ ID NO:82, SEQ ID 
NO:84. SEQ ID NO:85. SEQ ID NO:86. 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

9. A method for producing a polypeptide of claim I, the method comprising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein said 
cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide 
comprises a promoter sequence operably linked to a polynucleotide encoding the polypeptide of claim 
U and 
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10. An isolated antibody which specifically binds to a polypeptide f claim 1 . 

5 1 1 . An isolated polynucleotide comprising a polynucleotide sequence selected from the 

group consisting of: 

a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:44, SEQ ID 
NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:5 1, SEQ ID 
NO:52, SEQ ID NO:53, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID 
10 NO:59, SEQ ID NO:60, SEQ ID NO:61 . SEQ ID NO:62. SEQ ID NO:63, SEQ ID NO:64, SEQ ID 
NO:65, SEQ ID NO:66. SEQ ID NO:67, SEQ ID NO:68. SEQ ID NO:69, SEQ ID NO:70, SEQ ID 
NO:7 1. SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID 
NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:81 , SEQ ID NO:82, SEQ ID NO:84, SEQ ID 
NO:85, SEQ ID NO:86, 

15 b) a naturally occurring polynucleotide sequence having at least 70% sequence identity to a 

polynucleotide sequence selected from the group consisting of SEQ ID NO:44, SEQ ID NO:45, SEQ 
ID NO:46, SEQ ID NO:47, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:51, SEQ ID NO:52, SEQ ID 
NO:53, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID 
NO:60, SEQ ID NO:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID 

20 NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ID 
NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID 
NO:78, SEQ ID NO:79, SEQ ID NO:81, SEQ ID NO:82, SEQ ID NO:84, SEQ ID NO:85, SEQ ID 
NO:86, 

c) a polynucleotide sequence complementary to a), 
25 d) a polynucleotide sequence complementary to b), and 

e) an RNA equivalent of a)-d). 



12. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 1 1. 

13. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1 1, the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 
complex is formed between said probe and said target polynucleotide or fragments thereof, and 
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b) detecting the presence or absence of said hybridization complex, and. optionally, if 
present, the amount thereof. 



14. A method of claim 13, wherein the probe comprises at least 60 contiguous nucleotides. 

15. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1 1. the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain reaction 
amplification, and 

b) detecting the presence or absence of said amplified target polynucleotide or fragment 
thereof, and, optionally, if present, the amount thereof. 

16. A pharmaceutical composition comprising an effective amount of a polypeptide of claim 
1 and a pharmaceutically acceptable excipient. 

17. A pharmaceutical composition of claim 16, wherein the polypeptide comprises an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID NO:3, 
SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID 
NO.12. SEQ ID NO: 1 3. SEQ ID NO: 14. SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17. SEQ ID 
NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23. SEQ ID 
NO:24, SEQ ID NO:25, SEQ ID NO:26. SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29. SEQ ID 
NO:30, SEQ ID NO:31, SEQ ID NO:32. SEQ ID NO:33. SEQ ID NO:34, SEQ ID NO:35, SEQ ID 
NO:36, SEQ ID NO:38. SEQ ID NO:39, SEQ ID NO:41 , SEQ ID NO:42, and SEQ ID NO:43. 

18. A method for treating a disease or condition associated with decreased expression of 
functional TPPT, comprising administering to a patient in need of such treatment the pharmaceutical 
composition of claim 16. 

1 9. A method for screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting agonist activity in the sample. 

20. A pharmaceutical composition comprising an agonist compound identified by a method 
of claim 19 and a pharmaceutically acceptable excipient. 
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21 . A method for treating a disease or condition associated with decreased expression of 
functional TPPT, comprising administering to a patient in need of such treatment a pharmaceutical 
composition of claim 20. 



5 22. A method for screening a compound for effectiveness as an antagonist of a polypeptide 

of claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim I to a compound, and 

b) detecting antagonist activity in the sample. 

10 23. A pharmaceutical composition comprising an antagonist compound identified by a 

method of claim 22 and a pharmaceutically acceptable excipient. 

24. A method for treating a disease or condition associated with overexpression of functional 
TPPT, comprising administering to a patient in need of such treatment a pharmaceutical composition 
15 of claim 23. 



25. A method of screening for a compound that specifically binds to the polypeptide of claim 
1, said method comprising the steps of: 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 
20 conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test compound, thereby identifying a 
compound that specifically binds to the polypeptide of claim 1. 

26. A method of screening for a compound that modulates the activity of the polypeptide of 
25 claim 1, said method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under conditions 
permissive for the activity of the polypeptide of claim 1, 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test compound, 

and 

30 c) comparing the activity of the polypeptide of claim 1 in the presence of the test compound 

with the activity of the polypeptide of claim 1 in the absence of the test compound, wherein a change 
in the activity of the polypeptide of claim 1 in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide of claim 1. 

27. A method for screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 

103 



35 



o 



WO 00/78953 PCT/US00/16668 

comprising: 

a) exp sing a sample comprising the target polynucleotide to a compound, and 

b) detecting altered expression of the target polynucleotide. 

28. An isolated polynucleotide comprising a polynucleotide sequence of SEQ ID NO:83. 

29. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 28. 

30. A cell transformed with a recombinant polynucleotide of claim 29. 

3 1 . A transgenic organism comprising a recombinant polynucleotide of claim 29. 

32. A method for producing a polypeptide comprising an amino acid sequence of SEQ ID 
NO:40, the method comprising: 

a) culturing the cell of claim 30 under conditions suitable for expression of the polypeptide, 

and 

b) recovering the polypeptide so expressed. 

33. A method for screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 28, the method 
comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, and 

b) detecting altered expression of the target polynucleotide. 
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<110> INCYTE GENOMICS, INC. 
LAL, Preeti 
YANG, Junraing 
YUE, Henry 
HILLMAN, Jennifer L 
TANG, Y. Tom 
BANDMAN, Olga 
BURFORD, Neil 
BAUGHN, Marian R. 
AZIMZAI, Yalda 
LU, Dyung Aina M. 
AU-YOUNG, Janice 
PATTERSON , Chandra 

<120> HUMAN TRANSPORT PROTEINS 

<130> PF-0709 PCT 

<140> To Be Assigned 
<141> Herewith 



<150> 60/139,923; 60/148,177; 60/149 357- 60/169 ?R7 
<151> 1999-06-17; 1999-08-1 0 ; ImSSsS; 1999-^-28 
<160> 86 

<170> PERL Program 

<210> 1 
<211> 623 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> raisc_f eature 

<223> Incyte ID No: 264114CD1 

<400> 1 

Met Ser Thr Gin Asp Glu Arg Gin He Asn Thr Glu Tyr Ala Val 
Ser Leu Leu Glu Gin Leu Lys Leu Phe TyJ Glu Gin Gin Leu Pne 



Thr Asp lie Val Leu He Val Glu Gly Tta Glu Phe Pro Cys Hil 



30 



35 



Lys Met Val Leu Ala Thr Cys Ser Ser Tyr Phe Arg Ala Met Pne 

Met Ser Gly Leu Ser Glu Ser Lys Gin Thr His Val His Leu Arg 

70 75 
tie He Thr Tyr Ala Tyr 

Thr Gly Asn Leu Ala Met Asn Asp Ser Tnr Val Glu Gin Leu Ty? 
Glu Thr Ala Cys Phe Leu Gin Val Glu Asp Val Leu Gin Arg lys 
Arg Glu Tyr Leu lie Lys Lys He Asn 111 Glu Asn Cys Val Arg 



Asn Val Asp Ala Ala Thr Leu Gin lie lie lie Thr Tyr Ala Tyl 



Leu Leu Ser Phe Ala Asp Leu Phe Ser Cys Glu Glu Leu Lys lln 



140 



Ser Ala Lys Arg Met Val Glu His Lys Phe Thr Ala Val Tyr HI 



Gin Asp Ala Phe Met Gin Leu Ser His 1% Leu Leu He Asp He 



170 



Leu Ser Ser Asp Asn Leu Asn Val Glu Lys Glu Glu Thr Val A? g 



Glu Ala Ala Met Leu Trp Leu Glu Tyr Asn Thr Glu Ser Arg III 
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Asp Lys Ser Val Val Val Gin Gly Leu Tyr Lys Ser Met Pro Lys 

250 25S 

Phe Phe Lys Pro Arg Leu Gly Met Thr Lys Glu Glu Met Met lie 

265 270 

Phe He Glu Ala Ser Ser Glu Asn Pro Cys ser Leu Tyr Ser Ser 

Val Cys Tyr Ser Pro Gin Ala Glu Lys VaS Tyr Lys Leu Cys III 

295 300 
Pro Pro Ala Asp Leu His Lys Val Gly Thr Val Val Thr Pro Asp 

305 310 iic 

Asn Asp He Tyr lie Ala Gly Gly Gin Val Pro Leu Lys Asn Thr 

320 325 ->-i ft 

Lys Thr Asn His Ser Lys Thr Ser Lys Leu Gin Thr Ala Phe Arg 

~* 340 "*d5 

Thr Val Asn Cys Phe Tyr Trp Phe Asp Ala Gin Gin Asn Thr Trp 

350 355 ^rft 

Phe Pro Lys Thr Pro Met Leu Phe Val Arg lie Lys Pro Ser Leu 

3o5 370 

Val Cys cys Glu Gly Tyr lie Tyr Ala lie Gly Gly Asp Ser Val 

Gly Gly Glu Leu Asn Arg Arg Thr Val III Arg Tyr Asp Thr llu 

3°5 400 405 

Lys Asp Glu Trp Thr Met Val Ser Pro Leu Pro Cys Ala Trp Gin 

410 4^5 

Trp ser Ala Ala Val Val Val His Asp Cys He Tyr Val Met Tnr 

425 430 4^5 

Leu Asn Leu Met Tyr Cys Tyr Phe Pro Arg Ser Asp Ser Trp Val 

440 445 * 4 ca 

Glu Met Ala Met Arg Gin Thr Ser Arg Ser Phe Ala Ser Ala Ala 

Ala Phe Gly Asp Lys lie Phe Tyr He ily G ly Leu His He ill 
470 475 - nn 

Thr Asn Ser Gly lie Arg Leu Pro Ser Gly Thr Val Asp Gly Ser 
4 " 490 /qc 

Ser val Thr Val Glu He Tyr Asp Val Asn Lys Asn Glu Trp Lys 
500 505 

Met Ala Ala Asn lie Pro Ala Lys Arg Tyr Ser Asp Pro Cys Val 

515 520 cor 

Arg Ala Val Val lie Ser Asn Ser Leu Cys Val Phe Met Arg Glu 

® 535 54n 

Thr His Leu Asn Glu Arg Ala Lys Tyr Val Thr Tyr Gin Tyr Asp 

545 550 ccc 

Leu Glu Leu Asp Arg Trp Ser Leu Arg Gin His He Ser Glu Arg 

t/=n t m 560 565 570 

Val Leu Trp Asp Leu Gly Arg Asp Phe Arg Cys Thr Val Gly Lys 

r ™. 575 580 SflS 

Leu Tyr Pro Ser Cys Leu Glu Glu Ser Pro Trp Lys Pro Pro Thr 

590 595 £ nn 

Tyr Leu Phe Ser Thr Asp Gly Thr Glu Glu Phe Glu Leu Asp Gly 

Glu Met Val Ala Leu Pro Pro Val 61 ° 615 

620 

<210> 2 
<211> 99 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1455669CD1 

<400> 2 

Met Ala Ala Pro Ala Glu Pro Cys Ala Gly Gin Gly Val Trp Asn 
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15 10 15 

Gin Thr Glu Pro Glu Pro Ala Ala Thr Ser Leu Leu Ser Leu Cys 

20 25 30 

Phe Leu Arg Thr Ala Gly Val Trp Val Pro Pro Met Tyr Leu Trp 

35 40 45 

Val Leu Gly Pro He Tyr Leu Leu Phe He His His His Gly Arg 

50 55 6 q 

Gly Tyr Leu Arg Met Ser Pro Leu Phe Lys Ala Lys Met Val Ala 

65 70 75 

Ala He Pro Gly Ser Leu Glu Pro Gly Asn Val Arg Gly Arg Gin 

80 85 qn 

Gly Thr Gly Trp Asn Leu Val Lys Ser 

95 

<210> 3 
<211> 374 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2084989CD1 

<400> 3 

Met Glu Ser Lys Met Gly Glu Leu Pro Leu Asp lie Asn He Gin 
1 5 10 15 

Glu Pro Arg Trp Asp Gin Ser Thr Phe Leu Gly Arg Ala Arg His 

Phe Phe Thr Val Thr Asp Pro Arg Asn Leu Leu Leu Ser Gly Ala 
35 40 45 

Gin Leu Glu Ala Ser Arg Asn He Val Gin Asn Tyr Arg Ala Gly 

Val Val Thr Pro Gly H e Thr Glu Asp Gin Leu Trp Arg Ala hyt 
65 70 75 

Tyr Val Tyr Asp Ser Ala Phe His Pro Asp Thr Gly Glu Lys Val 
80 85 90 

Val Leu lie Gly Arg Met Ser Ala Gin Val Pro Met Asn Met Thr 
95 100 105 

He Thr Gly Cys Met Leu Thr Phe Tyr Arg Lys Thr Pro Thr Val 
110 115 12 o 

Val Phe Trp Gin Trp Val Asn Gin Ser Phe Asn Ala He Val Asn 
125 130 135 

Tyr Ser Asn Arg Ser Gly Asp Thr Pro He Thr Val Arg Gin Leu 
140 145 15 q 

Gly Thr Ala Tyr Val Ser Ala Thr Thr Gly Ala Val Ala Thr Ala 
155 160 165 

Leu Gly Leu Lys Ser Leu Thr Lys His Leu Pro Pro Leu Val Gly 
170 175 180 

Arg Phe Val Pro Phe Ala Ala Val Ala Ala Ala Asn Cys lie Asn 
185 190 195 

He Pro Leu Met Arg Gin Arg Glu Leu Gin Val Gly He Pro Val 
200 205 210 

Ala Asp Glu Ala Gly Gin Arg Leu Gly Tyr Ser Val Thr Ala Ala 
215 220 225 

Lys Gin Gly lie Phe Gin Val Val He Ser Arg lie Cys Met Ala 
230 235 240 

He Pro Ala Met Ala lie Pro Pro Leu lie Met Asp Thr Leu Glu 
245 250 255 

Lys Lys Asp Phe Leu Lys Val Gly Asp Cys Thr Ser Leu Val Leu 
260 265 270 

Glu Trp Ala Met Ala Gly Arg Ser Asp Gin Ala Pro Thr Leu Ser 
275 280 285 

Pro Ala Ser Pro Asp Ser Leu Arg Leu Ala Ser Pro Ser Pro Asp 
290 295 300 

Pro Cys Thr Ala Ser Ser Thr Phe Val His Ser Ala Arg Met Asn 
305 310 315 

Trp Ala Gly Val Lys Glu Leu Cys Arg Gly Arg Arg Arg Gly Gin 
320 325 33A 

Arg Lys Glu Thr Asn Phe He Ser Val Thr Pro Val Ala Ser Asp 
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335 340 -\ac 

Thr Gin Lys Gly Thr Val lie Val Met Leu Asp Leu Met Leu lie 

350 355 3g0 

Leu Leu Pro Pro Ser Ala Ser He Leu Arg Gly Thr His Gly 

365 370 

<210> 4 
<211> 271 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2501034CD1 

<400> 4 

Met Gly Asn Gly Gly Arg Ser Gly Leu Gin Gin Gly Lys Gly Asn 

Val Asp Gly Val Ala Ala Thr Pro Thr All Ala Ser Ala Ser cJs 

20 25 30 

Gin Tyr Arg Cys lie Glu Cys Asn Gin Glu Ala Lys Glu Leu Tyr 

35 40 45 

Arg Asp Tyr Asn His Gly Val Leu Lys He Thr He Cys Lys Ser 

50 55 * 

Cys Gin Lys Pro Val Asp Lys Tyr He Glu Tyr Asp Pro Val lie 

65 70 7c 

He Leu He Asn Ala He Leu Cys Lys Ala Gin Ala Tyr Arg His 

He Leu Phe Asn Thr Gin He Asn He His Gly Lys Leu Cys lie 

Phe Cys Leu Leu Cys Glu Ala Tyr Leu Jrg Trp Trp Gin Leu Gin 
110 115 120 

Asp Ser Asn Gin Asn Thr Ala Pro Asp Asp Leu He Arg Tyr Ala 

Lys Glu Trp Asp Phe Tyr Arg Met Phe Ala He Ala Ala Leu Glu 

140 145 ^cq 

Gin Thr Ala Tyr Phe He Gly He Phe Thr Phe Leu Trp Val Glu 

155 160 ifit; 

Arg Pro Met Thr Ala Lys Lys Lys Pro Asn Phe He Leu Leu Leu 
0 175 180 

Lys Ala Leu Leu Leu Ser Ser Tyr Gly Lys Leu Leu Leu He Pro 

190 195 
Ala Val He Trp Glu His Asp Tyr Thr Ser Val Cys Leu Lys Leu 

200 205 ?in 

He Lys Val Phe Val Leu Thr Ser Asn Phe Gin Ala He Arg Val 

215 220 9?S 

Thr Leu Asn He Asn Arg Lys Leu Ser Phe Leu Ala Val Leu Ser 

235 240 
Gly Leu Leu Leu Glu Ser He Met Val Tyr Phe Phe Gin Ser Met 

245 250 
Glu Trp Asp Val Gly Ser Asp Tyr Ala lie Phe Lys Ser Gin Asp 

Phe 260 265 270 

<210> 5 

<211> 323 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2745212CD1 

<400> 5 

Met Ala Pro Lys Gin Asp Pro Lys Pro Lys Phe Gin Glu Gly Glu 

Arg Val Leu Cys Phe His Gly Pro Leu LeJ Tyr Glu Ala Lys C^s 

20 25 
Val Lys Val Ala He Lys Asp Lys Gin Val Lys Tyr Phe He His 
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35 4Q 
TVr Ser Gly Trp Asn Lys Asn Trp Asp Glu Trp Val Pro Glu Ser 
50 55 60 

Leu Gin Lys Gin Arg Glu 

Leu Gin Lys Ala A^n Gin Glu Gin Tyr Ala Glu Gly Lys Met Arg" 
80 85 9 5 

ly Leu Gin Gin Lys Asn 
100 105 



Arg val Leu Lys Tyr Val Asp Thr Asn Leu Gin Lys Gin Arg Gil 

80 0 _, 
Gly Ala Ala Pro Gly Lys Lys Thr Ser Gly Leu Gin Gin Lys Asn 

Val Glu Val Lys Thr Lys Lys Asn Lys 111 Lys Thr Pro Gly Asn 



Gly Asp Gly Gly Ser Thr Ser Glu Thr Pro Gin Pro Pro Arg Lys 
" 130 iir 

Lys Arg Ala Arg yal Asp Pro Thr Val Glu Asn Glu Glu Thr Phe 
140 145 150 



Met Asn Arg Val Glu Val Lys Val Lys Ila Pro Glu Glu Leu Lys 

Asp 
170 
Pro 

185 ±yu 
Asp Tyr Ala Asn Tyr Lys Lys Ser Arg Gly Asn Thr Asp Asn lys 

Glu Tyr Ala Val Asn Glu Val Val Ala Gly lie Lys Glu Tyr III 
215 220 225 



j-j^j lfio ilc 

Pro Trp Leu Val Asp Asp Trp Asp Leu lie Thr Arg Gin Lys Gil 
1/0 175 1Qn 

Leu Phe Tyr Leu Pro Ala Lys Lys Asn Val Asp Ser lie Leu Gil 



Asn Val Met Leu Gly Thr Gin Leu Leu Tyr Lys Phe Glu Arg Pro 

Leu Ala Asp His 
Gin Val Tyr Gly Ala Pro His Leu Leu Arg Leu Phe Val Arg lie 



^ju 235 o a r\ 

Gin Tyr Ala Glu lie Leu Ala Asp His Pro Asp Ala Pro Met Ser 



245 

Ala 

260 ™ 265 

Gly Ala Met Leu Ala Tyr Thr Pro Leu Asp Glu Lys Ser Leu All 
I 75 . 280 285 

Leu Lys Tyr Leu Ala Lys 
295 300 
r«e aer Ala ser Asp Tyr Glu Val Ala Pro 

Pro Glu Tyr His Arg Lys Ala Val 31 ° 315 

320 

<210> 6 
<211> 274 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 4833111CD1 

<400> 6 

Met Asp Arg Pro Gly Phe Val Ala Ala Leu Val Ala Gly Gly Val 

Ala Gly val Ser Val Asp Leu lie Leu Pne Pro Leu Asp Thr Al 

Lys Thr Arg Leu Gin Ser Pro Gin Gly Phi Ser Lys Ala Gly oly 
Jd 40 4 c 

Phe His Gly He Tyr Ala Gly Val Pro Ser Ala Ala lie Gly Ser 

Phe Pro Asn Ala Ala Ala Phe Phe lie Thr Tyr Glu Tyr Val Lys 

Trp Phe Leu His Ala Asp Ser Ser Ser Tyr Leu Thr Pro Met Lys 

His Met Leu Ala Ala Ser Ala Gly Glu vS Val Ala Cys Leu ill 

Arg val Pro Ser Glu Val Val Lys Gin Ar^ Ala Gin Val Ser 111 

Ser Thr Arg Thr Phe Gin lie Phe Ser HI lie Leu Tyr Glu HI 

130 i-aq 
Gly lie Gin Gly Leu Tyr Arg Gly Tyr Lys Ser Thr Val Leu Arg 



Leu Leu Leu Asn Tyr Leu His Asp Phe Leu Lys Tyr Leu Ala Lys 

zyU 295 -ann 

Asn Ser Ala Thr Leu Phe Ser Ala Ser Asp Tyr Glu Val Ala Pro 
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140 145 150 

Glu lie Pro Phe Ser Leu Val Gin Phe Pro Leu Trp Glu Ser Leu 

155 160 165 

Lys Ala Leu Trp Ser Trp Arg Gin Asp His Val Val Asp Ser Trp 

170 175 ln JT 

Gin Ser Ala Val Cys Gly Ala Phe Ala Gly Gly Phe Ala Ala Ala 

185 190 ioc 

Val Thr Thr Pro Leu Asp Val Ala Lys Thr Arg He Thr Leu Ala 

200 205 510 

Lys Ala Gly Ser Ser Thr Ala Asp Gly Asn Val Leu Ser Val Leu 

215 220 225 

His Gly Val Trp Arg Ser Gin Gly Leu Ala Gly Leu Phe Ala Gly 

230 235 040 

Val Phe Pro Arg Met Ala Ala He Ser Leu Gly Gly Phe He Phe 

245 250 255 

Leu Gly Ala Tyr Asp Arg Thr His Ser Leu Leu Leu Glu Val Gly 

260 265 97ft 

Arg Lys Ser Pro u 

<210> 7 

<211> 291 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 87 6677CD1 

<400> 7 

Met Asp Ser Arg Val Ser Ser Pro Glu Lys Gin Asp Lys Glu Asn 

Phe Val Gly Val Asn Asn Lys Arg Leu Gly Val Cys Gly Trp lie 

20 25 30 

Leu Phe Ser Leu Ser Phe Leu Leu Val He n e Thr Phe Pro lie 

35 40 ac 

Ser lie Trp Met Cys Leu Lys lie lie Lys Glu Tyr Glu Arg Ala 

50 55 " 

Val Val Phe Arg Leu Gly Arg He Gin Ala Asp Lys Ala Lys Gly 

65 70 7s 

Pro Gly Leu He Leu Val Leu Pro Cys lie Asp Val Phe Val Lys 

80 85 90 

Val Asp Leu Arg Thr Val Thr Cys Asn He Pro Pro Gin Glu lie 
t 95 3-00 105 

Leu Thr Arg Asp Ser Val Thr Thr Gin Val Asp Gly Val Val Tyr 
110 115 i?n 

Tyr Arg He Tyr Ser Ala Val Ser Ala Val Ala Asn Val Asn Asp 
125 130 n c 

Val His Gin Ala Thr Phe Leu Leu Ala Gin Thr Thr Leu Arg Asn 
140 145 ica 

Val Leu Gly Thr Gin Thr Leu Ser Gin He Leu Ala Gly Arg Glu 

Glu He Ala His Ser He Gin Thr Leu JeJ Asp Asp Ala Thr Glu 

170 ]_75 lao 

Leu Trp Gly He Arg Val Ala Arg Val Glu He Lys Asp Val Arg 

185 190 105 

He Pro Val Gin Leu Gin Arg Ser Met Ala Ala Glu Ala Glu Ala 

200 205 210 

Thr Arg Glu Ala Arg Ala Lys Val Leu Ala Ala Glu Gly Glu Met 

220 22S 
Asn Ala Ser Lys Ser Leu Lys Ser Ala Ser Met Val Leu Ala Glu 

230 235 240 

Ser Pro He Ala Leu Gin Leu Arg Tyr Leu Gin Thr Leu Ser Thr 

245 250 ? cr 

Val Ala Thr Glu Lys Asn Ser Thr He Val Phe Pro Leu Pro Met 

260 265 ?7n 

Asn He Leu Glu Gly He Gly Gly Val Ser Tyr Asp Asn His Lys 

275 280 9A5 

Lys Leu Pro Asn Lys Ala 
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290 

<210> 8 
<211> 381 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2326143CD1 

<400> 8 

Met Ser Arg His Glu Gly Val Ser Cys Asp Ala Cys Leu Lys Gly 

Asn Phe Arg Gly Arg Arg Tyr Lys Cys Leu lie Cys Tyr Asp T^r 
20 25 30 

Asp Leu Cys Ala Ser Cys Tyr Glu Ser Gly Ala Thr Thr Thr Arg 

" 40 
His Thr Thr Asp His Pro Met Gin Cys He Leu Thr Arg Val Asp 
50 , 55 eo 

yr Gly Gly Glu Ala Phe Ser Val Glu Gin P 
65 70 75 

ys Pro Tyr Cys Gly Lys Met Gly Tyr Thr G 

?° . 8 5 90 

lu His Val Thr Ser Glu His Ala Glu Thr S 
95 100 105 

ys Pro He Cys Ala Ala Leu Pro Gly Gly 
110 115 120 

Thr Asp Asp Phe Ala Ala His Leu Thr Leu 
125 130 135 

Arg Asp Leu Asp Glu Ser Ser Gly Val Arg 
140 145 150 



Phe Asp Leu Tyr Tyr Gly Gly Glu Ala Phe Ser Val Glu Gin Pro 

70 75 

Gin Ser Phe Thr Cys Pro Tyr Cys Gly Lys Met Gly Tyr Thr Glu 

85 Qf\ 

Thr Ser Leu Gin Glu His Val Thr Ser Glu His Ala Glu Thr Ser 

Thr Glu Val lie Cys Pro He Cys Ala Ma Leu Pro Gly Gly 111 
HO 115 ion 

Pro Asn His val Thr Asp Asp Phe Ala Ala His Leu Thr Leu Glu 

u- » 25 130 135 

Has Arg Ala Pro Arg Asp Leu Asp Glu Ser Ser Gly Val Arg His 

145 150 
Val Arg Arg Met Phe His Pro Gly Arg Gly Leu Gly Gly Pro Arg 

» n , * 155 160 165 

Ala Arg Arg Ser Asn Met His Phe Thr Ser Ser Ser Thr Gly Gly 

170 175 ' in i 

Leu Ser Ser Ser Gin Ser Ser Tyr Ser Pro Ser Asn Arg Glu Ala 
„ 185 190 ins 

Met Asp Pro He Ala Glu Leu Leu Ser Gin Leu Ser Gly Val Arg 

200 205 ?in 

Arg Ser Ala Gly Gly Gin Leu Asn Ser Ser Gly Pro Ser Ala Ser 

215 220 225 

Gin Leu Gin Gin Leu Gin Met Gin Leu Gin Leu Glu Arg Gin His 

235 240 
Ala Gin Ala Ala Arg Gin Gin Leu Glu Thr Ala Arg Asn Ala Thr 
* 250 255 

Arg Arg Thr Asn Thr Ser Ser Val Thr Thr Thr lie Thr Gin Ser 

260 265 97n 

Thr Ala Thr Thr Asn lie Ala Asn Thr Glu Ser Ser Gin Gin Thr 

275 280 285 

Leu Gin Asn Ser Gin Phe Leu Leu Thr Arg Leu Asn Asp Pro Lys 

290 295 -inn 

Met Ser Glu Thr Glu Arg Gin Ser Met Glu Ser Glu Arg Ala Asp 

305 310 
Arg Ser Leu Phe Val Gin Glu Leu Leu Leu Ser Thr Leu Val Arg 

Glu Glu Ser Ser Ser Ser Asp Glu Asp Asp Arg Gly Glu Met Ala 
335 340 345 



Asp Phe Gly Ala Met Gly Cys Val Asp lie Met Pro Leu Asp Val 

Ala Leu Glu Asn Leu Asn Leu Lys Glu III Asn Lys Gly Asn Glu 

3°5 370 17 r 

Pro Pro Pro Pro Pro Leu 
380 

<210> 9 

<211> 190 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 2786302CD1 

<400> 9 

Met Lys Tyr Gly Asn Glu lie Met Asn Lys Asp Pro Val Phe Arg 

, 5 10 15 

lie Ser Pro Arg Ser Arg Glu Thr His Pro Asn Pro Glu Glu Pro 

20 25 30 

Glu Glu Glu Asp Glu Asp Val Gin Ala Glu Arg Val Gin Ala Ala 

35 40 45 

Asn Ala Leu Thr Ala Pro Asn Leu Glu Glu Glu Pro Val He Thr 

*i o ^ 50 55 60 

Ala Ser Cys Leu His Lys Glu Tyr Tyr Glu Thr Lys Lys Ser Cys 
65 70 75 

Phe Ser Thr Arg Lys Lys Lys He Ala He Arg Asn Val Ser Phe 
80 85 90 

Cys Val Lys Lys Gly Glu Val Leu Gly Leu Leu Gly His Asn Gly 
95 100 105 

Ala Gly Lys Ser Thr Ser He Lys Met He Thr Gly Cys Thr Lys 

110 115 120 

Pro Thr Ala Gly Val Val Val Leu Gin Gly Ser Arg Ala Ser Val 

125 130 135 

Arg Gin Gin His Asp Asn Ser Leu Lys Phe Leu Gly Tyr Cys Pro 

140 145 
Gin Glu Asn Ser Leu Trp Pro Lys Leu Thr Met Lys Glu His Leu 

155 160 165 

Glu Leu Tyr Ala Ala Val Glu Arg Leu Gly Gin Lys Arg Cys Cys 

170 175 180 

Ser Gin Tyr Phe Thr He Gly Gly Arg Ser 

185 iso 

<210> 10 
<211> 297 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3735780CD1 

<400> 10 

Met Met Asp Ser Glu Ala His Glu Lys Arg Pro Pro He Leu Thr 
5 10 15 

Ser Ser Lys Gin Asp He Ser Pro His He Thr Asn Val Gly Glu 

20 25 30 

Met Lys His Tyr Leu Cys Gly Cys Cys Ala Ala Phe Asn Asn Val 

35 40 45 

Ala He Thr Phe Pro He Gin Lys Val Leu Phe Arg Gin Gin Leu 

50 55 60 

Tyr Gly He Lys Thr Arg Asp Ala He Leu Gin Leu Arg Arg Asp 

65 70 75 

Gly Phe Arg Asn Leu Tyr Arg Gly He Leu Pro Pro Leu Met Gin 

80 85 90 

Lys Thr Thr Thr Leu Ala Leu Met Phe Gly Leu Tyr Glu Asp Leu 

95 100 105 

Ser cys Leu Leu His Lys His Val Ser Ala Pro Glu Phe Ala Thr 
110 us 12 Q 

Ser Gly Val Ala Ala Val Leu Ala Gly Thr Thr Glu Ala He Phe 
„ „ 125 130 135 

Thr Pro Leu Glu Arg Val Gin Thr Leu Leu Gin Asp His Lys His 
„■ , 140 I" 150 

His Asp Lys Phe Thr Asn Thr Tyr Gin Ala Phe Lys Ala Leu Lys 
i55 160 165 

Cys His Gly He Gly Glu Tyr Tyr Arg Gly Leu Val Pro He Leu 

1^0 175 
Phe Arg Asn Gly Leu Ser Asn Val Leu Phe Phe Gly Leu Arg Gly 
185 190 195 

Pro He Lys Glu His Leu Pro Thr Ala Thr Thr His Ser Ala His 
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Leu val Asn Asp Phe lie Cys Gly Gly Leu Leu Gly Ala Met Leu 

220 225 
Gly Phe Leu Phe Phe Pro He Asn Val Val Lys Thr Arg lie Gin 



200 205 210 

Leu Leu Gly Ala Met Leu 
220 225 
Val Lys Thr Arg He Gin 

Ser Gin lie Gly Gly Glu Phe Gin Ser Phe Pro Lys Val Phe G Jn 

250 255 
Lys lie Trp Leu Glu Arg Asp Arg Lys Leu lie Asn Leu Phe Arg 

60 265 270 

Gly Ala His Leu Asn Tyr His Arg Ser Leu lie Ser Trp Gly lie 
Tl 275 280 285 

lie Asn Ala Thr Tyr Glu Phe Leu Leu Lys Val He 

290 295 
<210> 11 ^ 
<211> 89 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 039026CD1 

<400> 11 

Met Ala Ala Gin lie Pro Glu Ser Asp Gin lie Lys Gin Phe Lys 
Glu Phe Leu Gly Thr Tyr Asn Lys Leu Thr Glu Thr Cys Phe Leu 

Asp Cys Val Lys Asp Phe Thr Thr Arg Glu Val Lys Pro Glu GlS 

35 40 .j. 

Thr Thr Cys Ser Glu His Cys Leu Gin Lys Tyr Leu Lys Met Thr 

50 55 6Q 

Gin Arg He Ser Met Arg Phe Gin Glu Tyr His He Gin Gin Asn 

65 70 -c 

Glu Ala Leu Ala Ala Lys Ala Gly Leu Leu Gly Gin Pro Arg 

80 pc 
<210> 12 " 
<211> 115 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 260607CD1 

<400> 12 

Met Ala Leu lie Pro Ser Arg Val Trp Leu Pro Phe Ala Val Trp 

Val Val Asp Ser Ala Pro Val Arg Gly Leu" Val Arg Arg Glu Pro 
?° 25 30 

?ro 

Leu Leu Pro Val Leu lie Asn Leu Phe Ser Phe Phe Leu Thr Pro 
50 55 fiQ 

Ser Phe Trp Arg Gin Leu Gly Ala lie Leu Val Tyr Ala Ser LeU 
65 70 7 _ 

Leu Ala Glu Lys Thr Pro Phe Lys Thr Gin Arg Thr Leu Glu Gly 
80 85 on 

Asp Ala Leu Val Gly Ser Val Ser lie Phe Leu Cys Ala Lys Asp 

Arg Gin Thr Glu Ala Glu Arg Gly Cys Ser 105 
110 115 

<210> 13 
<211> 675 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 



Phe Leu Arg Thr Gly Ser Phe He Ala Leu Phe Tyr Phe Pro Pro 
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<223> Incyte ID No: 1429651CD1 
<400> 13 

Met Glu Ser Gly Thr Ser Ser Pro Gin Pro Pro Gin Leu Asp Pro 
Leu Asp Ala Phe Pro Gin Lys Gly Leu Glu Pro Gly Asp He All 

Val Leu Val Leu Tyr Phe Leu Phe Val Leu Ala Val Gly Leu TrS 

35 4Q 4 £ 

Ser Thr Val Lys Thr Lys Arg Asp Thr Val Lys Gly Tyr Phe Leu 

50 55 60 

Ala Gly Gly Asp Met Val Trp Trp Pro Val Gly Ala Ser Leu Phe 

65 70 7c 

Ala Ser Asn Val Gly Ser Gly His Phe lie Gly Leu Ala Gly Ser 

80 g5 Q rt 

Gly Ala Ala Thr Gly He Ser Val Ser Ala Tyr Glu Leu Asn Gly 

Leu Phe Ser Val Leu Met Leu Ala Trp III Phe Leu Pro lie Tyr 

110 H5 1 i n 

lie Ala Gly Gin Val Thr Thr Met Pro Glu Tyr Leu Arg Lys Arg 

125 130 -1-sc 

Phe Gly Gly lie Arg He Pro lie lie Leu Ala Val Leu Tyr Leu 

140 145 jcn 

Phe He Tyr He Phe Thr Lys He Ser Val Asp Met Tyr Ala Gly 



PCT/US0 0/1 6668 



Met Leu He Gly Ala Leu Thr Leu Met Gly Tyr Ser Phe Ala Ala 

215 220 
Val Gly Gly Met Glu Gly Leu Lys Glu Lys Tyr Phe Leu Ala Leu 
^ 235 240 

Ala Ser Asn Arg Ser Glu Asn Ser Ser Cys Gly Leu Pro Arg Glu 
245 250 tcc 

Asp Ala Phe His lie Phe Arg Asp Pro Leu Thr Ser Asp Leu Pro 
260 265 97n 

Trp Pro Gly Val Leu Phe Gly Met Ser He Pro Ser Leu Trp Tyr 

275 280 
Trp cys Thr Asp Gin Val lie Val Gin Arg Thr Leu Ala Ala Lys 
290 295 ^nn 

Asn Leu Ser His Ala Lys Gly Gly Ala Leu Met Ala Ala Tyr Leu 
* 310 315 

Lys Val Leu Pro Leu Phe He Met Val Phe Pro Gly Met Val Ser 
320 325 33Q 

Arg He Leu Phe Pro Asp Gin Val Ala Cys Ala Asp Pro Glu He 

^35 340 
Cys Gin Lys He Cys Ser Asn Pro Ser Gly Cys Ser Asp He Ala 
350 355 36Q 

Tyr Pro Lys Leu Val Leu Glu Leu Leu Pro Thr Gly Leu Arg Gly 
365 370 o 7 c 

Leu Met Met Ala Val Met Val Ala Ala Leu Met Ser Ser Leu Thr 



380 3 85 



Ser He Phe Asn Ser Ala Ser Thr He Phe Thr Met Asp Leu TrJ 



395 



Asn His Leu Arg Pro Arg Ala Ser Glu tyl Glu Leu Met lie Val 

Gly Arg Val Phe Val Leu Leu Leu Val til Val Ser He Leu 

425 430 j-ac 

He Pro Val Val Gin Ala Ser Gin Gly Gly Gin Leu Phe He Tyr 
440 445 .i Q 

He Gin Ser He Ser Ser Tyr Leu Gin Pro Pro Val Ala Val Val 

Phe He Met Gly Cys Phe Trp Lys Arg Thr Asn Glu Lys Gly 111 
470 475 4flrt 

Phe Trp Gly Leu lie Ser Gly Leu Leu Leu Gly Leu Val Arg Leu 
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Gin 




490 








495 


Pro 


Arg 


Cys 


Asp Gin 


Pro 


Asp 






CAE 








510 


Ser 


lie 


His 


Tyr 


Leu Tyr 


Phe 


Ser 






c *i n 








525 


Leu 


Tl o 

ixe 


inr 


val 


Ser Thr 


Val 


Ser 






c-jc 
JJJ 








540 


Lys 


V3.LU 


Met 


Val 


Ser His 


Leu 


Thr 


Val 




550 








555 


Val 


Gin 


Lys 


Glu Gin 


Ala 


Pro 


Thr 




565 








570 


Leu 


Ser 


Gin 


Asn Gly Met 


Pro 


Val 




580 








585 


Gin 


Phe 


Glu 


Met Val 


Gin 


Glu 


Cys 




595 








600 


Asp 


Met 


Thr 


Pro Lys 


Gin 


Ser 






610 






615 


Trp Leu Cys Gly He Gin Glu 


Lys 






625 








630 


Arg 


Ala 


Glu 


Ala 


He He 


Val 


Ser 






640 








645 


Lys 


Thr 


Leu 


Leu 


Asp Val 


Asn 


Leu 


He 




655 






660 


Phe 


He 


Trp Gly Tyr 


Phe 


Ala 






670 








675 
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485 

Val Leu Asp Phe He Tyr i 
500 

Glu Arg Pro Val Leu Val ] 
515 

Met He Leu Ser Thr Val r < 
530 

Trp Phe Thr Glu Pro Pro I 
545 

Trp Phe Thr Arg His Asp I 
560 

Pro Ala Ala Pro Leu Ser I 
575 

Glu Ala Ser Ser Ser Ser £ 
590 

Asn Thr Ser Lys Thr His £ 
605 

Lys Val Val Lys Ala He I 
620 

Gly Lys Glu Glu Leu Pro P 
635 

Leu Glu Glu Asn Pro Leu \ 
650 

He Phe Cys Val Ser Cys a 
665 

<210> 14 
<211> 320 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: 2069971CD1 

<400> 14 

Met Tyr His Cys His Ser Gly Ser Lys Pro Thr Glu Lys Gly Ala 
5 10 is 

Asn Glu Tyr Ala Tyr Ala Lys Trp Lys Leu Cys Ser Ala Ser Ala 
20 25 30 

He Cys Phe He Phe Met He Ala Glu Val Val Gly Gly His He 
35 40 45 

Ala Gly Ser Leu Ala Val Val Thr Asp Ala Ala His Leu Leu He 
50 55 60 

Asp Leu Thr Ser Phe Leu Leu Ser Leu Phe Ser Leu Trp Leu Ser 
65 70 75 

Ser Lys Pro Pro Ser Lys Arg Leu Thr Phe Gly Trp His Arg Ala 
80 85 go 

Glu He Leu Gly Ala Leu Leu Ser He Leu Cys He Trp Val Val 
95 100 105 

Thr Gly Val Leu Val Tyr Leu Ala Cys Glu Arg Leu Leu Tyr Pro 
110 H5 120 

Asp Tyr Gin He Gin Ala Thr Val Met He He Val Ser Ser Cys 
125 130 135 

Ala Val Ala Ala Asn He Val Leu Thr Val Val Leu His Gin Arg 
140 145 150 

Cys Leu Gly His Asn His Lys Glu Val Gin Ala Asn Ala Ser Val 
155 160 165 

Arg Ala Ala Phe Val His Ala Leu Gly Asp Leu Phe Gin Ser He 
170 175 180 

Ser Val Leu He Ser Ala Leu He He Tyr Phe Lys Pro Glu Tyr 
I 85 190 195 

Lys He Ala Asp Pro He Cys Thr Phe He Phe Ser He Leu Val 
200 205 210 

Leu Ala Ser Thr lie Thr He Leu Lys Asp Phe Ser lie Leu Leu 
215 220 225 

Met Glu Gly Val Pro Lys Ser Leu Asn Tyr Ser Gly Val Lys Glu 
230 235 240 
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Leu lie Leu Ala Val Asp Gly Val Leu Ser Val His Ser Leu His 

250 255 
He Trp Ser Leu Thr Met Asn Gin Val lie Leu Ser Ala His Val 

2"0 265 ?7n 

Ala Thr Ala Ala Ser Arg Asp Ser Gin Val Val Arg Arg Glu lie 
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*i 275 280 285 

Ala Lys Ala Leu Ser Lys Ser Phe Thr Met His Ser Leu Thr He 
290 295 ion 

Gin Met Glu Ser Pro Val Asp Gin Asp Pro Asp Cys Leu Phe Cys 
305 310 iic 

Glu Asp Pro Cys Asp J " 

320 

<210> 15 
<211> 462 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2329339CD1 

<400> 15 



Met Ala Glu Glu Gin Glu Phe Thr Gin Leu Cys Lys Leu Pro Ala 

Gin Pro Ser His Pro His Cys Val Asn Asi Thr Tyr Arg Ser Ala 
20 25 -an 

Gin His Ser Gin Ala Leu Leu Arg Gly Leu Leu Ala Leu Arg Asp 
35 40 4c 

Ser Gly lie Leu Phe Asp Val Val Leu Val Val Glu Gly Arg His 

He Glu Ala His Arg He Leu Leu Ala Ala Ser Cys Asp Tyr Phe 
65 70 7 c 

Arg Gly Met Phe Ala Gly Gly Leu Lys Glu Met Glu Gin Glu Glu 
80 Q5 nrs 

Val Leu He His Gly Val Ser Tyr Asn Ala Met Cys Gin He Leu 

His Phe lie Tyr Thr Ser Glu Leu Glu llu Ser Leu Ser Asn Val 

110 115 
Gin Glu Thr Leu Val Ala Ala Cys Gin Leu Gin He Pro Glu lie 

130 135 
He His Phe Cys Cys Asp Phe Leu Met Ser Trp Val Asp Glu Glu 

140 145 ic n 

Asn He Leu Asp Val Tyr Arg Leu Ala Glu Leu Phe Asp Leu Ser 

155 160 1 *;q 

Arg Leu Thr Glu Gin Leu Asp Thr Tyr lie Leu Lys Asn Phe val 

170 175 1Rn 

Ala Phe Ser Arg Thr Asp Lys Tyr Arg Gin Leu Pro Leu Glu Lys 

185 190 iqc 

Val Tyr Ser Leu Leu Ser Ser Asn Arg Leu Glu Val Ser Cys Glu 

200 205 9in 

Thr Glu Val Tyr Glu Gly Ala Leu Leu Tyr His Tyr Ser Leu Glu 

Gin Val Gin Ala Asp Gin He Ser Leu His Glu Pro Pro Lys III 
230 235 OACi 

Leu Glu Thr Val Arg Phe Pro Leu Met Glu Ala Glu Val Leu 111 
" 250 255 

Arg Leu His Asp Lys Leu Asp Pro Ser Pro Leu Arg Asp Thr Val 
260 265 07n 

Ala Ser Gly Leu Met Tyr His Arg Asn Glu Ser Leu Gin Pro Ser 

275 280 9fle: 

Leu Gin Ser Pro Gin Thr Glu Leu Arg Ser Asp Phe Gin Cys Val 
290 295 7nn 

Val Gly Phe Gly Gly He His Ser Thr Pro Ser Thr Val Leu Ser 
05 310 315 

Asp Gin Ala Lys Tyr Leu Asn Pro Leu Leu Gly Glu Trp Lys His 

Phe Thr Ala Ser Leu Ala Pro Arg Met III Asn Gin Gly He 111 
33 5 340 345 
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Val 


Leu 


Asn Asn 


Phe 


Val Tyr 








350 


Gin 


Gly 


Phe Arg 


Ala 


Glu Ser 








365 




His 


Asn 


Arg Trp 


Phe 


Gin He 








380 




Asp 


Leu 


Ser Val 


Cys 


Val Val 








395 




Gly 


Arg 


Asp Tyr 


His 


Asn Asp 








410 


Pro 


Ala 


Thr Asn 


Ser 


Trp Ala 








425 


Val 


Tyr 


Ala His 


Ala 


Gly Ala 








440 


Thr 


Cys 


Gly Arg 


Lys 


Leu He 








455 





355 360 
3 Cys Trp Arg Tyr Asp Pro Arg 
370 375 
a Ser Leu Gin Gin Glu His Ala 
385 390 
/ Arg Tyr He Tyr Ala Val Ala 
400 405 
i Asn Ala Val Glu Arg Tyr Asp 
415 420 
r Val Ala Pro Leu Lys Arg Glu 
430 435 
r Leu Glu Gly Lys Met Tyr He 
445 450 
5 Phe Ser Glu Gly 
460 

<210> 16 
<211> 98 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2540219CD1 

<400> 16 

Met Arg Ala Cys Ala Val Trp Leu Ala Gly Gly Met Ala Gly Ala 

, 5 10 15 

He Ser Trp Gly Thr Ala Thr Pro Met Asp Val Val Lys Ser Arg 

20 25 30 

Leu Gin Ala Asp Gly Val Tyr Leu Asn Lys Tyr Lys Gly Val Leu 

35 40 45 

Asp Cys He Ser Gin Ser Tyr Gin Lys Glu Gly Leu Lys Val Phe 

50 55 60 

Phe Arg Gly He Thr Val Asn Ala Val Arg Gly Phe Pro Met Ser 

65 70 75 

Ala Ala Met Phe Leu Gly Tyr Glu Leu Ser Leu Gin Ala He Arg 

80 85 90 

Gly Asp His Ala Val Thr Ser Pro 
95 

<210> 17 
<211> 748 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2722462CD1 



10 15 
Asp Ala Lys Ala Leu Ala 

25 30 
Phe Tyr Leu Met Glu Leu 

40 45 
Ser Leu Cys Glu Ala Pro 

55 60 
Tyr Val His Ala Thr Leu 

70 75 
Phe Glu Leu Cys Met Lys 

85 90 
lie Arg His Lys Arg Thr 
100 105 
Gin Phe Val Glu Ala lie 
US 120 
Val Arg Val Thr Arg Ala 
130 135 



<400> 17 








Met 


Asn 


Tyr 


Gin 


Glu Ala 


Ala 


1 








5 


Asn 


Asp 


Lys 


Phe 


Phe Thr 


His 


Ala 








20 




Tyr 


Leu 


Phe 


Ala His 
35 


Asn 


Ala 


Thr 


Ala 


Leu 


Leu Leu 
50 


Leu 


Ala 


Val 


Pro 


Ala 


Leu Arg 
65 


Leu 


Glu 


Leu 


Phe 


Ala 


Leu Met 
80 


Val 


Leu 


Arg 


Trp 


Leu Gly Leu 


His 










95 




Met 


Val 


Lys 


Thr 


Ser Val 
110 


Leu 


Val 


Val 


Leu 


Val 


Arg Gin 
125 


Met 
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Leu Arg Cys He Phe Leu Val Asp Cys Arg Tyr Cys Gly Gly Val 
40 145 150 

Arg Arg Asn Leu Arg Gin He Phe Gin Ser Leu Pro Pro Phe Met 
155 160 ICS 

Asp He Leu Leu Leu Leu Leu Phe Phe Met He He Phe Ala He 
170 175 . Rn 

Leu Gly Phe Tyr Leu Phe Ser Pro Asn Pro Ser Asp Pro Tyr Phe 
185 190 ins 

Ser Thr Leu Glu Asn Ser He Val Ser Leu Phe Val Leu Leu Thr 

205 210 
Thr Ala Asn Phe Pro Asp Val Met Met Pro Ser Tyr Ser Arg Asn 
215 220 235 

Pro Trp Ser Cys Val Phe Phe He Val Tyr Leu Ser lie Glu Leu 

235 240 
Tyr Phe He Met Asn Leu Leu Leu Ala Val Val Phe Asp Thr Phe 
245 250 555 

Asn Asp He Glu Lys Arg Lys Phe Lys Ser Leu Leu Leu His Lys 
260 265 onr\ 

Arg Thr Ala He Gin His Ala Tyr Arg Leu Leu He Ser Gin Arg 
275 280 one 

Arg Pro Ala Gly lie Ser Tyr Arg Gin Phe Glu Gly Leu Met Arg 

295 inn 
Phe Tyr Lys Pro Arg Met Ser Ala Arg Glu Arg Tyr Leu Thr Phe 
r 305 310 315 

Lys Ala Leu Asn Gin Asn Asn Thr Pro Leu Leu Ser Leu Lys Asd 
3 20 325 ^« 

Phe Tyr Asp He Tyr Glu Val Ala Ala Leu Lys Trp Lys Ala Lys 

340 

Lys Asn Arg Glu His Trp Phe Asp Glu Leu Pro Arg Thr Ala Leu 

350 355 
Leu He Phe Lys Gly He Asn He Leu Val Lys Ser Lys Ala Phe 

365 370 -17* 

Gin Tyr Phe Met Tyr Leu Val Val Ala Val Asn Gly Val Trp III 

Leu val Glu Thr Phe Met Leu Lys Gly Gly Asn Phe Phe Ser HI 
395 4Q0 ii- 

His Val Pro Trp Ser Tyr Leu Val Phe Leu Thr He Tyr Gly Val 

415 420 
Glu Leu Phe Leu Lys Val Ala Gly Leu Gly Pro Val Glu Tyr Leu 
^ 42 5 430 435 

Ser Ser Gly Trp Asn Leu Phe Asp Phe Ser Val Thr Val Phe Ala 
440 445 450 

Phe Leu Gly Leu Leu Ala Leu Ala Leu Asn Met Glu Pro Phe Tyr 

Phe He Val Val Leu Arg Pro Leu Gin Leu Leu Arg Leu Phe i" 

475 480 

Leu Lys Glu Arg Tyr Arg Asn Val Leu Asp Thr Met Phe Glu Leu 

485 49Q 

Leu Pro Arg Met Ala Ser Leu Gly Leu Thr Leu Leu He Phe Tyr 

500 505 ,-^ n 

Tyr Ser Phe Ala He Val Gly Met Glu Phe Phe Cys Gly He Val 

515 520 505 

Phe Pro Asn Cys Cys Asn Thr Ser Thr Val Ala Asp Ala Tyr Arg 

Trp Arg Asn His Thr Val Gly Asn Arg Thr Val Val Glu Glu lly 
545 550 55s 

Tyr Tyr Tyr Leu Asn Asn Phe Asp Asn He Leu Asn Ser Phe Val 

555 R7n 
Thr Leu Phe Glu Leu Thr Val Val Asn Asn Trp Tyr He He Met 

Glu Gly Val Thr Ser Gin Thr Ser His I™ Ser Arg Leu Tyr Ine 
590 595 finn 

Met Thr Phe Tyr lie Val Thr Met Val Val Met Thr He He Val 
"05 5i0 c-tc 

Ala Phe He Leu Glu Ala Phe Val Phe Arg Met Asn Tyr Ser Arg 

625 fi**n 

Lys Asn Gin Asp Ser Glu Val Asp Gly Gly He Thr Leu Glu Lys 
635 640 645 
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650 

Glu Ala Arg Gly Ala Ser i 
665 

Leu Ser Gin Met Glu Arg 
680 

Gly Arg Arg Ser Arg Thr I 
695 

Gin Glu Glu He Gin Glu 1 
710 

Glu Gin Gin Arg Gin Leu £ 
725 

Gin Pro Pro Gly Ser Arg c 
740 

<210> 18 
<211> 507 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2739264CD1 



Val 


Ala 


Val 
655 


Leu 


Glu 


Leu 


Tyr Arg 
660 


Asp 


Val 


Thr 


Arg 


Leu 


Leu 


Glu Thr 


Gin 




670 








675 


Gin 


His 
685 


Ser 


Met 


Val 


Phe Leu 
690 


Ser 


Asp Leu 


Ser 


Leu 


Lys 


Met Tyr 






700 








705 


Tyr 


Glu 


Glu 


His 


Ala Arg Glu Gin 






715 








720 


Ser 


Ser 


Ala 
730 


Ala 


Pro 


Ala 


Ala Gin 
735 


Arg 


Ser 


Gin 
745 


Thr 


Val 


Thr 





<400> 18 

Met Ala Phe Asn Phe Gly Ala Pro Ser Gly Thr Ser Gly Thr Ala 

Ala Ala Thr Ala Ala Pro Ala Gly Gly P Je Gly Gly Phe Gly Thr 

20 25 -, n 

Thr Ser Thr Thr Ala Gly Ser Ala Phe Ser Phe Ser Ala Pro Thr 

35 40 ac 

Asn Thr Gly Thr Thr Gly Leu Phe Gly Gly Thr Gin Asn Lys Gly 

55 fin 

Phe Gly Phe Gly Thr Gly Phe Gly Thr Thr Thr Gly Thr Ser Thr 

65 70 7 e 

Gly Leu Gly Thr Gly Leu Gly Thr Gly Leu Gly Phe Gly Gly Phe 

Asn Thr Gin Gin Gin Gin Gin Thr Thr Leu Gly Gly Leu Phe sir 
? 5 100 105 

la Pro Thr Gin Ser Asn Gin Leu He Asn 
110 115 120 

Ser Ala Pro Thr Leu Leu Gly Asp Glu Arg 
125 130 135 

Lys Trp Asn Gin Leu Gin Ala Phe Trp Gly 
140 145 iso 

Phe Asn Asn Asn He Pro Pro Val Glu Phe 
155 160 165 

Phe Cys Arg Phe Lys Ala Val Gly Tyr Ser 
T 170 175 iso 

Lys Asp Glu Asp Gly Leu Val Val Leu Val 
185 190 195 

Thr Glu lie Arg Ser Gin Gin Gin Gin Leu 
200 205 210 

Lys Val Leu Gly Gly Asn Gin Thr Leu Thr 
215 220 225 

- — c- Asp Gin Thr Glu 

230 235 24Q 

Glu Arg Ser Pro Asn Gly Thr Ser Arg 

245 250 255 



Gin Pro Thr Gin Ala Pro Thr Gin Ser Asn Gin Leu He Asn Thr 

115 ion 
Ala Ser Ala Leu Ser Ala Pro Thr Leu Leu Gly Asp Glu Arg Asp 

125 130 13c 

Ala He Leu Ala Lys Trp Asn Gin Leu Gin Ala Phe Trp Gly Thr 

140 145 150 

Gly Lys Gly Tyr Phe Asn Asn Asn He Pro Pro Val Glu Phe Thr 

Gin Glu Asn Pro Phe Cys Arg Phe Lys Ala Val Gly Tyr Ser C^s 
170 175 igA 

Met Pro Ser Asn Lys Asp Glu Asp Gly Leu Val Val Leu Val Phe 

185 190 1QE 

Asn Lys Lys Glu Thr Glu He Arg Ser Gin Gin Gin Gin Leu Val 
00 205 210 

Glu Ser Leu His Lys Val Leu Gly Gly Asn Gin Thr Leu Thr Val 

Asn Val Glu Gly Thr Lys Thr Leu Pro III Asp Gin Thr Glu Val 

230 235 oAf\ 

Val He Tyr Val Val Glu Arg Ser Pro Asn Gly Thr Ser Arg Arg 

245 250 
Val Pro Ala Thr Thr Leu Tyr Ala His Phe Glu Gin Ala Asn He 

265 270 
Lys Thr Gin Leu Gin Gin Leu Gly Val Thr Leu Ser Met Thr Arg 

275 280 ooc 

Thr Glu Leu Ser Pro Ala Gin He Lys Gin Leu Leu Gin Asn Pro 

2^0 295 mn 

Pro Ala Gly Val Asp Pro He He Trp Glu Gin Ala Lys Val Asp 

305 310 31c 

Asn Pro Asp Ser Glu Lys Leu He Pro Val Pro Met Val Gly Phe 

320 325 330 
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Lys Glu Leu Leu Arg Arg Leu Lys Val Gin Asp Gin Met Thr Lys 

340 iIr 
Gin His Gin Thr Arg Leu Asp lie lie Ser Glu Asp lie Ser Glu 

350 355 ^ fin 

Leu Gin Lys Asn Gin Thr Thr Ser Val Ala Lys He Ala Gin Tyr 

365 370 375 

Lys Arg Lys Leu Met Asp Leu Ser His Arg Thr Leu Gin Val Leu 

He Lys Gin Glu lie Gin Arg Lys Ser lly Tyr Ala lie Gin HI 

395 400 dfiR 

Asp Glu Glu Gin Leu Arg Val Gin Leu Asp Thr lie Gin Gly Glu 

410 4 _ n 
Leu Asn Ala Pro Thr Gin Phe Lys Gly Arg Leu Asn Glu Leu Met 

425 43Q 
Ser Gin lie Arg Met Gin Asn His Phe Gly Ala Val Arg Ser Glu 

445 a*(\ 
Glu Arg Tyr Tyr He Asp Ala Asp Leu Leu Arg Glu He Lys Gin 

455 4go 
His Leu Lys Gin Gin Gin Glu Gly Leu Ser His Leu He Ser II 

0 475 Aon 

He Lys Asp Asp Leu Glu Asp He Lys Leu Val Glu His Gly Leu 

485 49Q - Qq 

Asn Glu Thr lie His He Arg Gly Gly Val Phe Ser 

500 505 
<210> 19 DUi 
<211> 592 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2758310CD1 

<400> 19 

Met Trp Phe Cys Gly Gin Ser Thr Pro Phe Gly Cys Glu Leu His 
Asp Thr Cys Val Gin Leu Cys His Phe His Ser Ala Leu Leu Hit 

25 -in 

Arg Arg Gin Lys Pro Trp Pro Ser Pro Ala Val Phe Phe Arg Arg 
35 40 ac 

Asn Val Arg Gly Leu Pro Pro Arg Phe Ser Ser Pro Thr Pro Leu 
50 55 fin 

Trp Arg Lys Val Leu Ser Thr Ala Val Val Gly Ala Pro Leu Leu 

" 70 7 r 

Leu Gly Ala Arg Tyr Val Met Ala Glu Ala Arg Glu Lys Arg Arg 

80 Q5 
Met Arg Leu Val Val Asp Gly Met Gly Arg Phe Gly Arg Ser Leu 
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Lys Val Gly Leu Gin lie Ser Leu Asp Tyr Trp Trp Cys Thr Asn 
Gly 

125 130 ~ V-ac 

Ser Ala Cys His Gin Arg Ala Ala Asp Ala Leu Val Ala Gly Ala 
140 - * - 



xxu 115 i?n 

Val Val Leu Arg Gly Trp Lys Ser Pro Gly Tyr Leu Glu Val Met 



_ 100 105 

Asn 
120 
Met 
135 
Ala 

He Ser Asn Gly Gly Leu Tyr Val Lys Jeu Gly Gin Gly Leu Cys 

160 165 
Ser Phe Asn His Leu Leu Pro Pro Glu Tyr Thr Arg Thr Leu Arg 
170 175 «. q~ 

Val Leu Glu Asp Arg Ala Leu Lys Arg Gly Phe Gin Glu Val Asp 

" 190 1 

Glu Leu Phe Leu Glu Asp Phe Gin Ala Leu Pro His Glu Leu Phe 

205 *>in 
Gin Glu Phe Asp Tyr Gin Pro He Ala Ala Ala Ser Leu Ala Gin 
^15 220 oo^ 

Val His Arg Ala Lys Leu His Asp Gly Thr Ser Val Ala Val Lys 

235 oln 
Val Gin Tyr He Asp Leu Arg Asp Arg Phe Asp Gly Asp He His 
245 250 255 
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Ala Gly Cys Lys Val Asn Asp Val Glu Ala lie Arg Ser Gin Gly 
"5 340 34 c 

Leu Ala Val His Asp lie Ala Glu Lys Leu lie Lys Ala Phe Ala 
350 355 ifin 

Glu Gin lie Phe Tyr Thr Gly Phe lie His Ser Asp Pro His Pro 
365 370 i 7 r 

Gly Asn Val Leu Val Arg Lys Gly Pro Asp Gly Lys Ala Glu ill 

Val Leu Leu Asp His Gly Leu Tyr Gin Phe Leu Glu Glu Lys AsS 
" 400 Af\% 

Arg Ala Ala Leu Cys Gin Leu Trp Arg Ala lie lie Leu Arg Asp 

415 A?f) 
Asp Ala Ala Met Arg Ala His Ala Ala Ala Leu Gly Val Gin Asp 

425 430 
Tyr Leu Leu Phe Ala Glu Met Leu Met Gin Arg Pro Val Arg Leu 

440 445 
Gly Gin Leu Trp Gly Ser His Leu Leu Ser Arg Glu Glu Ala Ala 

455 4go 
Tyr Met Val Asp Met Ala Arg Glu Arg Phe Glu Ala Val Met Ala 

Val Leu Arg Glu Leu Pro Arg Pro Met tit Leu Val Leu Arg Isn 

485 490 4&e 

lie Asn Thr Val Arg Ala He Asn Val Ala Leu Gly Ala Pro Val 

500 505 
Asp Arg Tyr Phe Leu Met Ala Lys Arg Ala Val Arg Gly Trp Ser 

515 520 coc 

Arg Leu Ala Gly Ala Thr Tyr Arg Gly Val Tyr Gly Thr Ser Leu 

530 535 r An 

Leu Arg His Ala Lys Val Val Trp Glu Met Leu Lys Phe Glu Val 

545 550 ccc 

Ala Leu Arg Leu Glu Thr Leu Ala Met Arg Leu Thr Ala Leu Leu 
^ 565 57n 

Ala Arg Ala Leu Val His Leu Ser Leu Val Pro Pro Ala Glu Glu 

Leu Tyr Gin Tyr Leu Glu Thr 58 ° 585 

590 

<210> 20 

<211> 841 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2762348CD1 

<400> 20 

Met Ala Ser Val Phe Arg Ser C 

1 5 
Phe Leu Gin Val Glu Ala Ala •] 
20 

Glu Leu Gly Leu Val Gin Phe I 
35 

Ser Phe Gin Arg Lys Phe Val P 
50 

Leu Glu Arg He Leu Arg Phe I 
65 

He Val Val Gin Leu Leu Glu L 
80 



Glu 


Met 


Cys 


Leu 


Ser 


Gin 


Leu 




10 










15 


Cys 


Cys 


Val 


Ala 


Glu 


Leu Gly 




25 










30 


Asp 


Leu 


Asn 


Met 


Asn 


Val 


Asn 


Glu 


40 










45 


Val 


Arg 


Arg 


Cys 


Glu 


Ser 




55 










60 


Glu 


Asp 


Glu 


Met 


Gin 


Asn 


Glu 




70 










75 


Ser 


Pro 


Leu 


Thr 


Pro 


Leu 


Pro 




85 










90 
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Arg Glu Met He Thr Leu Glu Thr Val Leu Glu Lys Leu Glu Gly 

95 100 105 
Glu Leu Gin Glu Ala Asn Gin Asn Gin Gin Ala Leu Lys Gin Ser 

110 115 120 

Phe Leu Glu Leu Thr Glu Leu Lys Tyr Leu Leu Lys Lys Thr Gin 

125 130 135 

Asp Phe Phe Glu Thr Glu Thr Asn Leu Ala Asp Asp Phe Phe Thr 

140 145 150 

Glu Asp Thr Ser Gly Leu Leu Glu Leu Lys Ala Val Pro Ala Tyr 

155 160 165 

Met Thr Gly Lys Leu Gly Phe He Ala Gly Cys Asp Pro Thr Gly 

170 175 180 

Lys Arg Met Ala Ser Phe Glu Arg Leu Leu Trp Arg Val Cys Arg 

185 190 195 

Gly Asn Val Tyr Leu Lys Phe Ser Glu Met Asp Ala Pro Leu Glu 

200 205 210 

Asp Pro Val Thr Lys Glu Glu He Gin Lys His He Phe He He 

215 220 225 

Phe Tyr Gin Gly Glu Gin Leu Arg Gin Lys He Lys Lys He Cys 

230 235 240 

Asp Gly Phe Arg Ala Thr Val Tyr Pro Cys Pro Glu Pro Ala Val 

245 250 255 

Glu Arg Arg Glu Met Leu Glu Ser Val Asn Val Arg Leu Glu Asp 

260 265 270 

Leu He Thr Val He Thr Gin Thr Glu Ser His Arg Gin Arg Leu 

275 280 285 

Leu Gin Glu Ala Ala Ala Asn Trp His Ser Trp Leu He Lys Val 

r » 290 295 300 

Gin Lys Met Lys Ala Val Tyr His He Leu Asn Met Cys Asn He 
305 310 315 

Asp Val Thr Gin Gin Cys Val He Ala Glu He Trp Phe Pro Val 
320 325 330 

Ala Asp Ala Thr Arg He Lys Arg Ala Leu Glu Gin Gly Met Glu 
335 340 345 

Leu Ser Gly Ser Ser Met Ala Pro He Met Thr Thr Val Gin Ser 
mi. 350 355 360 

Lys Thr Ala Pro Pro Thr Phe Asn Arg Thr Asn Lys Phe Thr Ala 

365 370 375 

Gly Phe Gin Asn He Val Asp Ala Tyr Gly Val Gly Ser Tyr Arg 

380 385 390 

Glu He Asn Pro Ala Pro Tyr Thr He He Thr Phe Pro Phe Leu 

39 5 400 405 

Phe Ala Val Met Phe Gly Asp Cys Gly His Gly Thr Val Met Leu 

410 415 420 

Leu Ala Ala Leu Trp Met He Leu Asn Glu Arg Arg Leu Leu Ser 

425 4 30 435 

Gin Lys Thr Asp Asn Glu He Trp Asn Thr Phe Phe His Gly Arg 
^ 44 ° 445 450 

Tyr Leu He Leu Leu Met Gly He Phe Ser lie Tyr Thr Gly Leu 
T1 ^ ^ 455 460 465 

He Tyr Asn Asp Cys Phe Ser Lys Ser Leu Asn He Phe Gly Ser 

470 475 4QQ 

Ser Trp Ser Val Gin Pro Met Phe Arg Asn Gly Thr Trp Asn Thr 

485 490 495 

His Val Met Glu Glu Ser Leu Tyr Leu Gin Leu Asp Pro Ala He 

500 505 510 

Pro Gly Val Tyr Phe Gly Asn Pro Tyr Pro Phe Gly He Asp Pro 

515 520 525 

He Trp Asn Leu Ala Ser Asn Lys Leu Thr Phe Leu Asn Ser Tyr 

530 535 540 

Lys Met Lys Met Ser Val He Leu Gly lie Val Gin Met Val Phe 

545 550 555 

Gly Val lie Leu Ser Leu Phe Asn His lie Tyr Phe Arg Arg Thr 

560 565 570 

Leu Asn lie lie Leu Gin Phe lie Pro Glu Met lie Phe lie Leu 

575 580 585 

Cys Leu Phe Gly Tyr Leu Val Phe Met lie lie Phe Lys Trp Cys 

59 ° 595 600 
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Cys Phe Asp Val His Val Ser Gin His Ala Pro Ser He Leu He 
605 610 615 

His Phe He Asn Met Phe Leu Phe Asn Tyr Ser Asp Ser Ser Asn 

*i ~ . 620 625 630 

Ala Pro Leu Tyr Lys His Gin Gin Glu Val Gin Ser Phe Phe Val 

635 640 645 

Val Met Ala Leu lie Ser Val Pro Trp Met Leu Leu He Lys Pro 
650 655 660 

Phe lie Leu Arg Ala Ser His Arg Lys Ser Gin Leu Gin Ala Ser 

665 670 675 

Arg lie Gin Glu Asp Ala Thr Glu Asn He Glu Gly Asp Ser Ser 

680 685 690 

Ser Pro Ser Ser Arg Ser Gly Gin Arg Thr Ser Ala Asp Thr His 

695 700 705 

Gly Ala Leu Asp Asp His Gly Glu Glu Phe Asn Phe Gly Asp Val 

710 715 720 

Phe Val His Gin Ala He His Thr lie Glu Tyr Cys Leu Gly Cys 

725 730 735 

lie Ser Asn Thr Ala Ser Tyr Leu Arg Leu Trp Ala Leu Ser Leu 

740 745 750 

Ala His Ala Gin Leu Ser Glu Val Leu Trp Thr Met Val Met Asn 

755 760 765 

Ser Gly Leu Gin Thr Arg Gly Trp Gly Gly lie Val Gly Val Phe 

770 775 78Q 

He lie Phe Ala Val Phe Ala Val Leu Thr Val Ala lie Leu Leu 

785 790 795 

He Met Glu Gly Leu Ser Ala Phe Leu His Ala Leu Arg Leu His 

800 805 810 

Trp Val Glu Phe Gin Asn Lys Phe Tyr Val Gly Asp Gly Tyr Lys 

815 820 825 

Phe Ser Pro Phe Ser Phe Lys His lie Leu Asp Gly Thr Ala Glu 

Glu 830 835 

<210> 21 
<211> 253 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3715961CD1 

<400> 21 

Met Ser Glu Cys Pro Leu lie Leu Tyr lie His Lys His He Asp 

10 15 
Thr Tyr Ser Gin Ser Tyr Leu Phe Asn Asp Leu Phe Tyr Pro Val 
m. 20 25 30 

Tyr Ser Gly Gly Arg Met Val Thr Tyr Glu His Leu Arg Glu Val 

35 40 45 

Val Phe Gly Lys Ser Glu Asp Glu His Tyr Pro Leu Trp Lys Ser 

50 55 g 0 

Val lie Gly Gly Met Met Ala Gly Val lie Gly Gin Phe Leu Ala 

65 70 75 

Asn Pro Thr Asp Leu Val Lys Val Gin Met Gin Met Glu Gly Lys 

80 85 90 

Arg Lys Leu Glu Gly Lys Pro Leu Arg Phe Arg Gly Val His His 

95 100 105 

Ala Phe Ala Lys lie Leu Ala Glu Gly Gly He Arg Gly Leu Trp 

110 us 120 

Ala Gly Trp Val Pro Asn He Gin Arg Ala Ala Leu Val Asn Met 

125 130 135 

Gly Asp Leu Thr Thr Tyr Asp Thr Val Lys His Tyr Leu Val Leu 

I 40 145 150 

Asn Thr Pro Leu Glu Asp Asn lie Met Thr His Gly Leu Ser Ser 

I 55 160 165 

Leu cys ser Gly Leu Val Ala Ser He Leu Gly Thr Pro Ala Asp 

170 175 180 
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val He Lys Ser Arg il e Met Asn Gin Pro Arg Asp Lys Gin Gly 

Lys Ser Ser Thr 
Phe Met Ser Leu 

Thr Pro Trp Ser ^ _ 

Tyr Glu Lys lie teg G lu Met Ser Gly lit Ser Pro Phe ^° 



190 195 

Ala 
210 
Pro 

_ _ ^ J -- J 220 one 

Ser Trp Leu Arg Met Thr Pro Trp Ser Met Val Phe Trp Leu Thr 



Arg Gly Leu Leu Tyr Lys Ser Ser Thr Asp Cys Leu lie Gin HI 
Gly 

21 5 220 " 225 



-J. — — — <-» UJ.il X4JLCI 

Val Gin Gly Glu Gly Phe Met Ser Leu ™ Lys Gly Phe Leu Pro" 



245 250 
<210> 22 3U 

<211> 229 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5108194CD1 

<400> 22 

Met Gly Asn Gly Val Lys Glu Gly Pro Val Arg Leu His Glu Asp 

Ala Glu Ala Val Leu Ser Ser Ser Val Ser Ser Lys Arg Asp His 

Arg Gin Val Leu Ser Ser Leu Leu Ser G^y Ala Leu Ala Gly a?! 

40 jir 
Leu Ala Lys Thr Ala Val Ala Pro Leu Asp Arg Thr Lys lie lie 
DO 55 cr\ 

Phe Gin val Ser Ser Lys Arg Phe Ser Ala Lys Glu Ala Phe Ar^ 

70 TC 

Val Leu Tyr Tyr Thr Tyr Leu Asn Glu Gly Phe Leu Ser Leu Trp" 

85 on 
Arg Gly Asn Ser Ala Thr Met Val Arg Val Val Pro Tyr Ala Al 

100 1 nt 

He Gin Phe Ser Ala His Glu Glu Tyr Lys Arg lie Leu Gly Ser 

111) lie -lift 

Tyr Tyr Gly Phe Arg Gly Glu Ala Leu Pro Pro Trp Pro Arg llu 
Phe Ala Gly Ala Leu Ala Gly Thr Thr Ala Ala Ser Leu Thr Tyr 



155 160 J " lis 
Ile Arg lie Ser Arg Glu 

175 180 

Phe Met Pro Thr Val Leu 

190 195 

Phe Phe Thr Tyr Glu Thr 

^ uu 205 ^1 n 

Leu Lys Ser Leu His Arg Glu Tyr Ser Gly Arg Lys Leu lie llo 



Met Tyr Ser Asn lie Phe His Val Phe lie Arg He Ser Arg g" 

Thr 

185 190 Tot; 

Gly Val lie Pro Tyr Ala Gly Leu Ser Phe Phe Thr Tyr Glu Thr 



Glu Gly Leu Lys Thr Leu Tyr His Gly Phe Met Pro Thr Val III 



Pro Leu Asp Leu Val Arg Ala Arg Met Ala Val Thr Pro Lys gII 

Ile 
170 
Thr 
185 
Tyr 
200 
His 

Phe Ser Glu Gly 220 225 

<210> 23 
<211> 170 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 5503122CD1 

<400> 23 

Met Tyr Asp Asn Leu Tyr Leu His Gly lie Glu Asp Ser Glu Ala 
Gly Ser Ala Asp Ser Tyr Thr Ser Arg Pro Ser Asp Ser Asp v" 
Ser Leu Glu Glu Asp Arg Glu Ala lie Ar| Gin Glu Arg Glu Gin 
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40 45 
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Gin Ala Ala He Gin Leu Glu Arg Ala Lys Ser Lys Pro Val Ala 

Phe Ala Val Lys Thr Asn Val Ser Tyr Cys Gly Ala Leu Asp 61 J 
65 70 7C 

Asp val Pro Val Pro Ser Thr Ala lie Ser Phe Asp Ala Lys Asp 
80 g5 q« 

Phe Leu His He Lys Glu Lys Tyr Asn Asn Asp Trp Trp He Gly 
95 100 ml 

Arg Leu Val Lys Glu Gly Cys Glu lie Gly Phe lie Pro Ser Pro 
110 115 ion 

Leu Arg Leu Glu Asn lie Arg He Gin Gin Glu Gin Lys Arg Gly 
125 130 17^ 

Arg Phe His Gly Gly Lys Ser Ser Gly Asn Ser Ser Ser Ser Leu 
140 

Gly Glu Met Val Ser Gly Thr Phe Arg Ala Thr Pro Thr Ser Thr 

155 ifio 1 cc 

Gly Glu Gly Cys Ser 165 
170 

<210> 24 
<211> 655 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 5517972CD1 

<400> 24 

Met Ser Ser Ser Asn Val Glu Val Phe lie Pro Val Ser Gin Gly 
Asn Thr Asn Gly Phe Pro Ala Thr Ala Ser" Asn Asp Leu Lys a\1 

20 25 -an 

Phe Thr Glu Gly Ala Val Leu Ser Phe His Asn He Cys Tyr Arg 
35 40 4 c 

Val Lys Leu Lys Ser Gly Phe Leu Pro Cys Arg Lys Pro Val Glu 
50 55 fin 

Lys Glu He Leu Ser Asn He Asn Gly He Met Lys Pro Gly Leu 
°5 70 7 c 

Asn Ala He Leu Gly Pro Thr Gly Gly Gly Lys Ser Ser Leu Leu 
80 Q5 nf\ 

Asp Val Leu Ala Ala Arg Lys Asp Pro Ser Gly Leu Ser Gly Asp 

100 105 
.. Ala 

A 1 ? . 115 120 

Thr 
135 

_ Thr 

145 150 
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Ser Gly Tyr Val Val Gin Asp Asp Val Val Met Gly Thr Leu Thr 

125 ^30 •] -jn 

Val Arg Glu Asn Leu Gin Phe Ser Ala Ala Leu Arg Leu Ala Thr 

140 145 - c n 

Thr Met Thr Asn His Glu Lys Asn Glu Arg He Asn Arg Val ill 



Val Leu He Asn Gly Ala Pro Arg Pro Ala Asn Phe Lys Cys Asn 

110 H5 120 

Val Met Gly Thr Leu Thr 

130 135 

Ala Leu Arg Leu Ala Thr 

145 150 

Arg He Asn Arg Val He 

Gin Glu Leu Gly Leu Asp Lys Val Ala Hp Ser Lys Val Gly Thr 
^ 175 i on 

Gin Phe He Arg Gly val Ser Gly Gly Glu Arg Lys Arg Thr Ser 
185 190 i qc 

He Gly Met Glu Leu He Thr Asp Pro Ser He Leu Phe Leu Asp 
200 205 oin 

Glu Pro Thr Thr Gly Leu Asp Ser Ser Thr Ala Asn Ala Val Leu 
215 220 oor 

Leu Leu Leu Lys Arg Met Ser Lys Gin Gly Arg Thr He He Phe 

235 oan 
Ser He His Gin Pro Arg Tyr Ser He Phe Lys Leu Phe Asp Ser 
245 250 oc.ci 

Leu Thr Leu Leu Ala Ser Gly Arg Leu Met Phe His Gly Pro Ala 
260 265 07n 

Gin Glu Ala Leu Gly Tyr Phe Glu Ser Ala Gly Tyr His Cys Glu 
»n ™_ 275 280 pas 

Ala Tyr Asn Asn Pro Ala Asp Phe Phe Leu Asp He He Asn Gly 
290 295 300 
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Asp Ser Thr Ala Val Ala Leu Asn Arg Glu Glu Asp Phe Lys Ala 
305 320 iic 

Thr Glu lie lie Glu Pro Ser Lys Gin Asp Lys Pro Leu lie Glu 
" 325 330 

Lys Leu Ala Glu lie Tyr Val Asn Ser Ser Phe Tyr Lys Glu Thr 

335 34Q 
Lys Ala Glu Leu His Gin Leu Ser Gly Gly Glu Lys Lys Lys Lys 

350 355 * 

lie Thr val Phe Lys Glu lie Ser Tyr Thr Thr Ser Phe Cys His 

370 375 
Gin Leu Arg Trp Val Ser Lys Arg Ser Phe Lys Asn Leu Leu Gly 
380 385 Ton 

Asn Pro Gin Ala Ser lie Ala Gin lie He Val Thr Val Val Leu 
395 400 d Ac 

Gly Leu Val lie Gly Ala lie Tyr Phe Gly Leu Lys Asn Asp ill 
41u 415 420 

Thr Gly He Gin Asn Arg Ala Gly Val Leu Phe Phe Leu Thr Thr 

430 435 
Asn Gin Cys Phe Ser Ser Val Ser Ala Val Glu Leu Phe Val Val 

445 450 
Glu Lys Lys Leu Phe He His Glu Tyr He Ser Gly Tyr Tyr Arg 
455 4gQ acZ 

Val Ser Ser Tyr Phe Leu Gly Lys Leu Leu Ser Asp Leu Leu Pro 
470 475 Aan 

Met Arg Met Leu Pro Ser He He Phe Thr Cys He val Tyr Phe 
485 490 4 qc 

Met Leu Gly Leu Lys Pro Lys Ala Asp Ala Phe Phe Val Met Met 
500 505 c-in 

Phe Thr Leu Met Met Val Ala Tyr Ser Ala Ser Ser Met Ala Leu 

520 505 
Ala He Ala Ala Gly Gin Ser Val Val Ser Val Ala Thr Leu Leu 

530 535 
Met Thr lie Cys Phe Val Phe Met Met He Phe Ser Gly Leu Leu 
545 550 ccc 

Val Asn Leu Thr Thr He Ala Ser Trp Leu Ser Trp Leu Gin Tyr 

565 570 
Phe Ser He Pro Arg Tyr Gly Phe Thr Ala Leu Gin His Asn Glu 

Phe Leu Gly Gin Asn Phe Cys Pro Gly l" Asn Ala Thr Gly Isn 
590 595 J finn 

Asn Pro Cys Asn Tyr Ala Thr Cys Thr Gly Glu Glu Tyr Leu Val 

610 fii5 

Lys Gin Gly He Asp Leu Ser Pro Trp Gly Leu Trp Lys Asn His 

625 63n 

Val Ala Leu Ala Cys Met He Val He Phe Leu Thr lie Ala Tyr 
635 640 gjc 

Leu Lys Leu Leu Phe Leu Lys Lys Tyr Ser 
650 655 

<210> 25 ^ 

<211> 184 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc__feature 

<223> Incyte ID No: 5593114CD1 

<400> 25 

Met Trp Val Phe Gly Tyr Gly Ser Leu lie Trp Lys Val Asp Phe 
Pro Tyr Gin Asp Lys Leu Val Gly Tyr iJe Thr Asn Tyr Ser Ar^ 

Arg Phe Trp Gin Gly Ser Thr Asp His Arg Gly Val Pro Gly Lys" 
3 5 40 ac 

Pro Gly Arg Val Val Thr Leu Val Glu Asp Pro Ala Gly Cys Val 
50 55 , n 

Trp Gly Val Ala Tyr Arg Leu Pro Val Gly Lys Glu Glu Glu Val 
65 70 75 
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Arg 


Glu 


Lys Gly Gly Tyr Arg Thr Thr 


Lys 




85 




90 


Asp 


Pro Thr Thr Lys 


Pro 


Phe Ser 


Thr 




100 




1 OR 


Cys 


Asp Asn Pro Asp 


Tyr 


Leu Gly 


lie 


Ala 


115 




120 


Glu Gin He Phe 


Asn 


Ala Ala 


Thr 




130 




135 


Glu 


Tyr Leu Phe Glu 


Leu 


Ala Asn 


Pro 


Glu 


145 




150 


Glu Ala Asp Glu 


His 


Leu Phe 






160 




165 


Lys Glu Arg Leu Glu Gly 


Lys 


Gin Asn 






175 


180 



80 

Thr Val He Phe Tyr I 
95 

Val Leu Leu Tyr He C 
110 

Pro Ala Pro Leu Glu / 
125 

Gly Pro Ser Gly Arg P. 

140 

Ser He Arg Asn Leu \ 
155 

Ala Leu Glu Lys Leu \ 
170 

Leu Asn Cys He 

<210> 26 

<211> 154 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 044775CD1 

<400> 26 

Met Gly Ala Phe Glu Cys Val Arg Lys Val Tyr Gin Thr Asp Gly 

5 10 15 

Leu Lys Gly Phe Tyr Arg Gly Met Ser Ala Ser Tyr Ala Gly He 

Ser Glu Thr Val lie His Phe Val He T^r Glu Ser lie Lys Gin" 
35 40 45 

Lys Leu Leu Glu Tyr Lys Thr Ala Ser Thr Met Glu Asn Asp Glu 

Glu Ser Val Lys Glu Ala Ser Asp Phe Val Gly Met Met Leu Ala 

65 7q -c 

Ala Ala Thr Ser Lys Thr Cys Ala Thr Thr He Ala Tyr Pro His 
80 gtj 

Glu Val Val Arg Thr Arg Leu Arg Glu Glu Gly Thr Lys Tyr Arg 

Ser Phe Phe Gin Thr Leu Ser Leu Leu Val Gin Glu Glu Gly Tyr 

115 120 
Gly Ser Leu Tyr Arg Gly Leu Thr Thr His Leu Val Arg Gin He 
125 130 t->c 

Pro Asn Thr Ala He Met Met Ala Thr Tyr Glu Leu Val Val Tyr 

140 145 i5n 

Leu Leu Asn Gly 150 

<210> 27 
<211> 438 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 116588CD1 



<400> 27 



Met Leu Leu Val Thr Pro Arg Pro Glu Arg Gly Gly Arg Gly Thr 
Glu Leu Gly Glu Phe Cys Gly Thr Pro Le2 Leu Phe Ser Ser T^r 



Phe Cys Tyr Asp Asn Pro Ala Ala Leu Gin Thr Gin Val Lys Arg 
35 4Q 4 ? 

Asp Met Gin Val Asn Thr Thr Lys Phe Met Leu Leu Tyr Ala Trp 
_ 50 55 c Q 

Tyr Ser Trp Pro Asn Val Val Leu Cys Phe Phe Gly Gly Phe Leu 
65 70 75 



23/60 



WO 00/78953 PCT/USOO/16668 

He Asp Arg Val Phe Gly He Arg Trp Gly Thr He He Phe Ser 
_ , 80 85 90 

Cys Phe Val Cys lie Gly Gin Val Val Phe Ala Leu Gly Gly He 

95 100 105 

Phe Asn Ala Phe Trp Leu Met Glu Phe Gly Arg Phe Val Phe Glv 

110 H5 120 

He Gly Gly Glu Ser Leu Ala Val Ala Gin Asn Thr Tyr Ala Val 

125 130 135 

Ser Trp Phe Lys Gly Lys Glu Leu Asn Leu Val Phe Gly Leu Gin 

140 i45 150 

Leu Ser Met Ala Arg lie Gly Ser Thr Val Asn Met Asn Leu Met 

155 160 165 

Gly Trp Leu Tyr Ser Lys He Glu Ala Leu Leu Gly Ser Ala Gly 

170 175 iao 

His Thr Thr Leu Gly He Thr Leu Met lie Gly Gly Val Thr Cys 
_ . 185 190 195 

lie Leu Ser Leu lie Cys Ala Leu Ala Leu Ala Tyr Leu Asp Gin 

200 205 210 

Arg Ala Glu Arg lie Leu His Lys Glu Gin Gly Lys Thr Gly Glu 

tr i , 215 220 225 

Val lie Lys Leu Thr Asp Val Lys Asp Phe Ser Leu Pro Leu Trp 

230 235 240 

Leu He Phe He lie Cys Val Cys Tyr Tyr Val Ala Val Phe Pro 

245 250 255 

Phe lie Gly Leu Gly Lys Val Phe Phe Thr Glu Lys Phe Gly Phe 

260 265 270 

Ser Ser Gin Ala Ala Ser Ala He Asn Ser Val Val Tyr Val He 

275 280 285 

Ser Ala Pro Met Ser Pro Val Phe Gly Leu Leu Val Asp Lys Thr 

290 295 300 

Gly Lys Asn lie lie Trp Val Leu Cys Ala Val Ala Ala Thr Leu 
,r n „ 305 310 315 

Val Ser His Met Met Leu Ala Phe Thr Met Trp Asn Pro Trp lie 

320 325 330 

Ala Met Cys Leu Leu Gly Leu Ser Tyr Ser Leu Leu Ala Cys Ala 

335 3 40 345 

Leu Trp Pro Met Val Ala Phe Val Val Pro Glu His Gin Leu Gly 

350 355 3 gi 

Thr Ala Tyr Gly Phe Met Gin Ser lie Gin Asn Leu Gly Leu Ala 

365 370 375 

lie lie Ser He lie Ala Gly Met lie Leu Asp Ser Arg Gly Tyr 

380 385 290 

Leu Phe Leu Glu Val Phe Phe lie Ala Cys Val Ser Leu Ser Leu 
_ . 395 400 405 

Leu Ser Val Val Leu Leu Tyr Leu Val Asn Arg Ala Gin Gly Gly 

41° 415 420 

Asn Leu Asn Tyr Ser Ala Arg Gin Arg Glu Glu lie Lys Phe Ser 

«■ mi. , 425 430 435 

His Thr Glu 3 

<210> 28 

<211> 237 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc__f eature 

<223> Incyte ID No: 875369CD1 

<400> 28 

Met Ala His Val Gly Ser Arg Lys Arg Ser Arg Ser Arg Ser Arg 

Ser Arg Gly Arg Gly Ser Glu Lys Arg Lys Lys Lys Ser Arg Lys 
20 25 30 

Asp Thr Ser Arg Asn Cys Ser Ala Ser Thr Ser Gin Gly Arg Lys 
35 40 45 

Ala Ser Thr Ala Pro Gly Ala Glu Ala Ser Pro Ser Pro Cys He 
50 55 eo 
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65 

Ser Ser Ser Ser Ser Ser ■ 
80 

Ser Ser Ser Ser Ser Ser I 
95 

Lys Tyr Lys Asp Lys Arg J 
110 

Leu Lys Lys Lys Gly Lys C 
125 

Ala Leu Pro Gly Pro Ser I 
140 

Glu Glu Glu Asp Gly Pro \ 
155 

lie Gin Ala Met Lys Pro 1- 
170 

Gin Ser lie lie Arg Lys \ 
185 

Arg Leu lie Lys Gly Asp C 
200 

Lys Glu Arg His Arg Glu I 
215 

Cys Leu Ala Phe Gin Met A 
230 

<210> 29 
<211> 219 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1325518CD1 



PCI7US00/16668 



Ala 


Arg Arg Arg 


Thr Arg Ser Ser 


Ser 


70 


75 


Ser Ser Ser 


Ser Ser Ser Ser 


Ser 


85 


90 


Asp Gly Arg Lys Lys Arg Gly 


Lys 


100 


105 


Lys Lys Lys 


Lys Arg Lys Lys 




115 


120 


Lys 


Ala Glu Ala 


Gin Gin Val Glu 




130 


135 


Asp Gin Trp His 


Arg Ser Ala Gly 




145 


150 


Leu 


Thr Asp Glu Gin Lys Ser Arg 


Thr 


160 


165 


Lys Glu Glu 


Trp Asp Ala Arg 


Val 


175 


180 


Asp Pro Glu 


Thr Gly Arg Thr 


Glu 


190 


195 


Val Leu Glu 


Glu lie Val Thr 


Asn 


205 


210 


Lys Gin Ala 


Thr Arg Gly Asp 




220 


225 


Ala Gly Leu Leu 


Pro 




235 





<400> 29 

Met Lys Leu Leu Leu Trp Ala Cys lie val Cys Val Ala Phe Ala 

Arg Lys Arg Arg Phe Pro Phe lie Gly c" Asp Asp Asn Asp A Jp 
20 25 -an 

Gly His Pro Leu His Pro Ser Leu Asn He Pro Tyr Gly lie Arg 

40 

Asn Leu Pro Pro Pro Leu Tyr Tyr Arg Pro Val Asn Thr Val Pro 

Ser Tyr Pro Gly Asn Thr Tyr Thr Asp t" Gly Leu Pro Ser Tyi 
65 7Q ijc 

Pro Trp lie Leu Thr Ser Pro Gly Phe Pro Tyr Val Tyr His He 

Arg Gly Phe Pro Leu Ala Thr Gin Leu Asn Val Pro Pro Leu Pro" 
95 100 105 

Ser Arg Phe Phe Ser Ala 

Ala Ala Ala Pro Ala Ala Pro Pro lie III Ala Glu Pro Ala HI 
±? 5 . 130 135 



Pro Arg Gly Phe Pro Phe Val Pro Pro Ser Arg Phe Phe Ser III 



Ala Ala Pro Leu Thr Ala Thr Pro Val Ala Ala Glu Pro Ala HI 
140 145 - cn 

Gly Ala Pro Val Ala Ala Glu Pro Ala Ala Glu Ala Pro Val Gly 

160 i fit; 

Ala Glu Pro Ala Ala Glu Ala Pro Val Ala Ala Glu Pro Ala Ala 

170 i 75 - OA 

Glu Ala Pro Val Gly Val Glu Pro Ala Ala Glu Glu Pro Ser Pro 

185 190 i 

Ala Glu Pro Ala Thr Ala Lys Pro Ala Ala Pro Glu Pro His Pro 

^00 205 oin 

Ser Pro Ser Leu Glu Gin Ala Asn Gin 
215 

<210> 30 
<211> 707 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> roisc_feature 

<223> Incyte ID No: 2060987CD1 

<400> 30 

Met Ala Ala Ala Ala Thr Ala Ala Glu Gly Val Pro Ser Arg Gly 

Pro Pro Gly Glu Val lie His Leu Asn VaJ Gly Gly Lys Arg Pne 

Ser Thr Ser Arg Gin Thr Leu Thr Trp ill Pro Asp Ser Phe Pne 

Ser ser Leu Leu Ser Gly Arg lie Ser Thr Leu Lys Asp Glu Tar 

5Q cc 
Gly Ala He Phe lie Asp Arg Asp Pro Thr Val Phe Ala Pro 

70 "re 
Leu Asn Phe Leu Arg Thr Lys Glu Leu Asp Pro Arg Gly Val His 

Gly Ser Ser Leu Leu His Glu Ala Gin Phe Tyr Gly Leu Thr PrS 

" 100 i nc 

Leu Val Arg Arg Leu Gin Leu Arg Glu Glu Leu Asp Arg Ser III 

Cys Gly Asn Val Leu Phe Asn Gly Tyr HI Pro Pro Pro Val J n ° 
125 130 i c 

Pro Val Lys Arg Arg Asn Arg His Ser Leu Val Gly Pro Gin III 

140 145 -I en 

Leu Gly Gly Arg Pro Ala Pro Val Arg Arg Ser Asn Thr Met III 

160 ice 
Pro Asn Leu Gly Asn Ala Gly Leu Leu Gly Arg Met Leu Asp g" 

X f\j 175 1 oa 

Lys Thr Pro Pro Ser Pro Ser Gly Gin Pro Glu Glu Pro Gly Me? 

185 190 1 ac 

Val Arg Leu Val Cy S Gly His His Asn Trp lie Ala Val Ala 

Thr Gin Phe Leu Val Cys Tyr Arg Leu lyt Glu Ala Ser Gly Tr£ 

Gin Leu Val Phe Ser Ser Pro Arg Leu lsp Trp Pro lie Glu Irg 

Leu Ala Leu Thr Ala Arg Val His Gly J?J Ala Leu Gly Glu hJs 
ff 5 250 
Ala 

260 
Glu 
275 
Val 
290 
His 
305 
Gin 

320 395 ~ J ~ 

Asp Ala Ala Gly Ser Phe Leu Leu Leu Gly Cys Asn Asn Gly III 

He Tyr Tyr Val Asp Val Gin Lys Phe Pro" Leu Arg Met Lys lsp 

JbO 355 ■* J' 

Asn Asp Leu Leu Val Ser Glu Leu Tyr Arg Asp Pro Ala Glu AsS 
Gly Val Thr Ala Leu Ser Val Tyr Leu Thr Pro Lys Thr Ser lsp 
Ser Gly Asn Trp lie Glu lie Ala Tyr III Thr Ser Ser Gly III 



Asp Lys Met Val Ala Ala Ala Thr Gly Ser Glu lie Leu Leu ?£ 



Ala Leu Gin Ala Glu Gly Gly Gly Ser 25 n e Gly Val Phe £2 

280 285 
Phe Val Gly Asn Gin Leu 
295 300 
Gly Val Trp Asn Ala Val 

Thr Lys His Trp Gin Val Gin Glu Val III Pro lie Thr Ser T^r 



Leu Gly Val Pro Val Glu Ala Leu Phe Phe Val Gly Asn Gin III 



He Ala Thr Ser His Thr Gly Arg He Gly Val Trp Asn Ala VaJ 



Val Arg val lie Val Gin His Pro Glu ill Val Gly Ser Gly ill 



410 



Gin Leu Phe Gin Thr Phe Thr Val His Arg Ser Pro Val Thr £! 
He Met Leu Ser Glu Lys His Leu lie til val Cys Ala Asp ill 
Asn His Val Arg Thr Trp Ser Val Thr Arg Phe Arg Gly Met ill 
Ser Thr Gin Pro Gly Ser Thr Pro Leu ill Ser Phe Lys lie til 
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Leu Gly Ser Arg Pro Arg Arg Tyr Leu Leu Thr Gly Gin Ala III 



Pro Thr Thr Ala Phe Thr Val Leu Glu Cys Glu Gly Ser Arg Arg 
545 

Pro Arg Arg Tyr Leu Leu 

560 565 " " ™" 

Gly Ser Leu Ala Met Trp Asp Leu Thr Thr Ala Met Asp Gly Leu 
"5 580 c ft e 

Gly Gin Ala Pro Ala Gly Gly Leu Thr Glu Gin Glu Leu Met Glu 
590 595 , nn 

Gin Leu Glu His Cys Glu Leu Ala Pro Pro Ala Pro Ser Ala Pro 
605 610 cic 

Ser Trp Gly Cys Leu Pro Ser Pro Ser Pro Arg lie Ser Leu Tnr 

Ser Leu His Ser Ala Ser Ser Asn Thr Ser Leu Ser Gly His £J 

640 fi «r 

Gly Ser Pro Ser Pro Pro Gin Ala Glu Ala Arg Arg Arg Gly Gly 

Gly Ser Phe Val Glu Arg Cys Gin Glu til Val Arg Ser Gly III 
665 670 fi 7 c 

LeU I™ Pro Thr Pro Ala Pro Trp Pro Ser Ser Gly 

Leu Gly Thr Pro Leu Thr Pro Pro Lys Met Lys Leu Asn Glu Sr 
Ser Phe 695 700 705 

<210> 31 
<211> 279 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2172064CD1 

<400> 31 

Met Cys Gly Arg Phe Leu Arg Arg Leu Leu Ala Glu Glu Ser Arg 

Arg Ser Thr Pro Val Gly Arg Leu Leu LeS Pro Val Leu Leu G Jy 

Phe Arg Leu Val Leu Leu Ala Ala Ser G^y Pro Gly Val Tyr Gly 
35 40 A i 

Asp Glu Gin Ser Glu Phe Val Cys His Thr Gin Gin Pro Gly Cys 
50 55 J £ n 

Lys Ala Ala Cys Phe Asp Ala Phe His Pro Leu Ser Pro Leu Arg 



Ser Trp Val Phe Gin Val lie Leu Val A Ia Val Pro Ser Ala Leu 

Tyr Met Gly Phe Thr Leu Tyr His Val He Trp His Trp Glu Leu" 

Ser Gly Lys Gly Lys Glu Glu Glu Thr JeS lie Gin Gly Arg g!u 

115 120 



110 



Gly Asn Thr Asp Val Pro Gly Ala Gly Ser Leu Arg Leu Leu Trp 



125 130 135 



Ala Tyr Val Ala Gin Leu Gly Ala Arg Leu Val Leu Glu Gly All 



Ala Leu Gly Leu Gin Tyr His Leu Tyr G Jy Phe Gin Met Pro Ser 



155 



Ser Phe Ala Cys Arg Arg Glu Pro Cys tin Gly Ser He Thr Cys 



170 175 - Ho 



Asn Leu Ser Arg Pro Ser Glu Lys Thr lie Phe Leu Lys Thr Met 
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185 190 195 

Phe Gly Val Ser Gly Phe Cys Leu Leu Phe Thr Phe Leu Glu Leu 

200 205 210 

Val Leu Leu Gly Leu Gly Arg Trp Trp Arg Thr Trp Lys His Lys 

215 220 225 

Ser Ser Ser Ser Lys Tyr Phe Leu Thr Ser Glu Ser Thr Arg Arg 

230 235 240 

His Lys Lys Ala Thr Asp Ser Leu Pro Val Val Glu Thr Lys Glu 

245 250 255 

Gin Phe Gin Glu Ala Val Pro Gly Arg Ser Leu Ala Gin Glu Lys 

260 265 270 

Gin Arg Pro Val Gly Pro Arg Asp Ala 

275 

<210> 32 
<211> 154 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2219267CD1 



<400> 32 



Met Val Thr Gly 


Leu 


Ala 


1 


5 




Tyr Leu Pro Gly 


Trp Thr 




20 




Leu Ala Ser Arg 


Ala 


Val 




35 




Glu Val Phe Pro 


Thr 


Val 




50 




Leu Gly Ala Gly 


Phe 


Leu 




65 




Leu His Gly Arg 


Gin 


Gly 




80 




Ser Leu Ala Val 


Leu 


Ala 




95 




Ser Arg Ser Arg 


Gly Leu 




110 




Leu Arg Arg Ser 


Pro 


Leu 




125 




Leu Pro Leu Leu 


Pro 


Pro 




140 




Pro Glu Gin His 







10 is 
Leu Ser Val Leu Gly Leu 

25 30 
Ser Ser Leu Phe Ala Ala 

40 45 
Ala Gly Leu Gly Leu Val 

55 60 
Ala Gly Pro Leu Asp Thr 

70 75 
Gin Gin Val Val Phe Ala 

85 90 
Val Leu Leu Leu Pro Glu 
100 105 
Leu Gin Asp Ala Asp Arg 
115 120 
Arg Pro Arg Gin Asp His 
130 135 
Tyr Trp Ala Gly His Thr 
145 150 



<210> 33 
<211> 289 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2308629CD1 

<400> 33 



Met 
1 


Val Ala 


Gly 


Ala Val Ala 


Tyr 


Pro He 


Asp 


5 

Cys Val Lys 








20 


Asp 


Pro Ala 


Ala 


Arg Tyr Arg 








35 


He 


He Arg 


Thr 


Glu Gly Leu 








50 


Val 


Thr Ala 


Thr 


Gly Ala Gly 








65 


Cys 


Tyr Glu 


Lys 


Leu Lys Lys 








80 


Gly Gly Asn 


Ser 


His He Ala 



10 15 
Met Gin Ser Leu Gin Pro 

25 30 
Leu Glu Ala Leu Trp Arg 

40 45 
Pro Met Arg Gly Leu Asn 

55 60 
Sis Ala Leu Tyr Phe Ala 

70 75 
ser Asp Val He His Pro 

85 90 
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Met 




100 




105 


Asn 


Pro 


Ala Glu Val Val 


Lys 






115 




120 


Ser 


Pro 


Tyr 


His Arg Val Thr 


Asp 






130 




135 


Asn Glu Gly Ala Gly Ala Phe 


Tyr 


Thr 




145 




150 


Met 


Asn 


Val Pro Phe Gin 


Ala 


cuts 




160 




165 


Leu 


Gin 


Glu His Phe Asn 


Pro 




His 


175 




180 


Ser 


Val 


Leu Ser Gly Ala 


Cys 


Ala 




190 




195 


Thr 


Thr 


Pro Leu Asp Val 


Cys 


Glu 




205 




210 


Ser 


Leu 


Ala Leu Asn Ser 


His 






220 




225 


Met 


Ala 


Ser 


Ala Phe Arg Thr 


Val 


Ala 




235 


240 


Tyr Phe Arg Gly Val Gin Ala 






250 




255 


Ser 


Thr 


Ala 


He Ala Trp Ser 


Val 


lie 




265 


270 


Thr 


Lys 


Arg Gin Glu Glu 


Trp 






280 




285 



95 

Thr Leu Leu His Asp Ala 1 
110 

Gin Arg Met Gin Met Tyr i 
125 

Cys Val Arg Ala Val Trp C 
140 

Arg Ser Tyr Thr Thr Gin I 
155 

He His Phe Met Thr Tyr C 
170 

Gin Arg Arg Tyr Asn Pro £ 
185 

Ala Gly Ala Val Ala Ala J 
200 

Lys Thr Leu Leu Asn Thr C 
215 

He Thr Gly His He Thr G 
230 

Tyr Gin Val Gly Gly Val 1 
245 

Arg Val He Tyr Gin He F 
260 

Tyr Glu Phe Phe Lys Tyr L 
275 

Arg Ala Gly Lys 

<210> 34 
<211> 300 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2 660038CD1 

<400> 34 

Met Asp Phe Leu Met Ser Gly Leu Ala Ala Cys Gly Ala Cys Val 

Phe Thr Asn Pro Leu Glu Val Val Lys Thr Arg Met Gin Leu Gin 

20 25 30 

Gly Glu Leu Gin Ala Pro Gly Thr Tyr Gin Arg His Oyr Arg Asn 
tr i ^ 35 40 45 

Val Phe His Ala Phe He Thr He Gly Lys Val Asp Gly Leu Ala 

50 55 g Q 

Ala Leu Gin Lys Gly Leu Ala Pro Ala Leu Leu Tyr Gin Phe Leu 

65 70 75 

Met Asn Gly He Arg Leu Gly Thr Tyr Gly Leu Ala Glu Ala Gly 

80 85 oh 

Gly Tyr Leu His Thr Ala Glu Ala Thr His Ser Pro Ala Arg Ser 

Ala Ala Ala Gly Ala Met Ala Gly Val Met Gly Ala Tyr Leu Gly 

110 us 12 n 

Ser Pro lie Tyr Met Val Lys Thr His Leu Gin Ala Gin Ala Ala 

125 13Q 225 

Ser Glu He Ala Val Gly His Gin Tyr Lys His Gin Gly Met Phe 

140 145 150 

Gin Ala Leu Thr Glu He Gly Gin Lys His Gly Leu Val Gly Leu 

155 igo 165 

Trp Arg Gly Ala Leu Gly Gly Leu Pro Arg Val He Val Gly Ser 

170 175 180 

Ser Thr Gin Leu Cys Thr Phe Ser Ser Thr Lys Asp Leu Leu Ser 

185 190 iQc 

Gin Trp Glu lie Phe Pro Pro Gin Ser Trp Lys Leu Ala Leu Val 

200 205 210 

Ala Ala Met Met Ser Gly He Ala Val Val Leu Ala Met Ala Pro 
„w 215 220 225 

Phe Asp Val Ala Cys Thr Arg Leu Tyr Asn Gin Pro Thr Asp Ala 
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230 235 240 

Gin Gly Lys Gly Leu Met Tyr Arg Gly lie Leu Asp Ala Leu Leu 

45 25 0 255 

Gin Thr Ala Arg Thr Glu Gly He Phe Gly Met Tyr Lys Gly He 

260 265 270 

Gly Ala Ser Tyr Phe Arg Leu Gly Pro His Thr He Leu Ser Leu 
n . _ 275 280 285 

Phe Phe Trp Asp Gin Leu Arg Ser Leu Tyr Tyr Thr Asp Thr Lys 

290 295 300 

<210> 35 

<211> 382 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2670745CD1 

<400> 35 

Met Leu Arg Trp Thr Val His Leu Glu Gly Gly Pro Arg Arg Val 

Asn His Ala Ala Val Ala Val Gly His Arg Val Tyr Ser Phe Gly 

2° 25 30 

Gly Tyr Cys Ser Gly Glu Asp Tyr Glu Thr Leu Arg Gin He Asp 

35 40 45 

Val His He Phe Asn Ala Val Ser Leu Arg Trp Thr Lys Leu Pro 

50 55 g0 

Pro Val Lys Ser Ala He Arg Gly Gin Ala Pro Val Val Pro Tyr 

65 70 75 

Met Arg Tyr Gly His Ser Thr Val Leu lie Asp Asp Thr Val Leu 

80 85 90 

Leu Trp Gly Gly Arg Asn Asp Thr Glu Gly Ala Cys Asn Val Leu 

Tyr Ala Phe Asp Val Asn Thr His Lys Trp" Phe Thr Pro Arg Val 
HO 115 120 

Ser Gly Thr Val Pro Gly Ala Arg Asp Gly His Ser Ala Cys Val 
125 130 135 

Leu Gly Lys He Met Tyr He Phe Gly Gly Tyr Glu Gin Gin Ala 

Asp Cys Phe Ser Asn Asp He His Lys Leu Asp Thr Ser Thr ilt 

155 2.6O i fit; 

Thr Trp Thr Leu He Cys Thr Lys Gly Ser Pro Ala Arg Trp Arg 

170 i qq 

Asp Phe His Ser Ala Thr Met Leu Gly Ser His Met Tyr Val Phe 

1®5 190 295 

Gly Gly Arg Ala Asp Arg Phe Gly Pro Phe His Ser Asn Asn Glu 

200 205 210 

He Tyr Cys Asn Arg He Arg Val Phe Asp Thr Arg Thr Glu Ala 
_ 215 220 225 

Trp Leu Asp Cys Pro Pro Thr Pro Val Leu Pro Glu Gly Arg Arg 

230 235 240 

Ser His Ser Ala Phe Gly Tyr Asn Gly Glu Leu Tyr lie Phe Gly 

245 250 255 

Gly Tyr Asn Ala Arg Leu Asn Arg His Phe His Asp Leu Trp Lys 

2°0 265 270 

Phe Asn Pro Val Ser Phe Thr Trp Lys Lys He Glu Pro Lys Gly 
* 280 285 

Lys Gly Pro Cys Pro Arg Arg Arg Gin Cys Cys Cys He Val Gly 

295 3Q0 
Asp Lys He Val Leu Phe Gly Gly Thr Ser Pro Ser Pro Glu Glu 

305 310 315 

Gly Leu Gly Asp Glu Phe Asp Leu He Asp His Ser Asp Leu His 

320 325 33Q 

He Leu Asp Phe Ser Pro Ser Leu Lys Thr Leu Cys Lys Leu Ala 

335 340 345 

Val He Gin Tyr Asn Leu Asp Gin Ser Cys Leu Pro His Asp He 

350 355 360 
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Arg Trp Glu Leu Asn Ala Met Thr Thr Asn Ser Asn lie Ser Ara 
365 370 -lit 

Pro He Val Ser Ser His Gly 
380 

<210> 36 
<211> 287 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2676443CD1 

<400> 36 

Met Ala Ala Glu Ala Arg Val Ser Arg Trp Tyr Phe Gly Gly Leu 

Ala Ser Cys Gly Ala Ala Cys Cys Thr His Pro Leu Asp Leu Leu 

20 25 30 

Lys Val His Leu Gin Thr Gin Gin Glu Val Lys Leu Arg Met Thr 

35 40 45 

Gly Met Ala Leu Arg Val Val Arg Thr Asp Gly He Leu Ala Leu 

55 60 
Tyr Ser Gly Leu Ser Ala Ser Leu Cys Arg Gin Met Thr Tyr Ser 

65 70 75 

Leu Thr Arg Phe Ala lie Tyr Glu Thr Val Arg Asp Arg Val Ala 

80 Q5 q 0 

Lys Gly Ser Gin Gly Pro Leu Pro Phe His Glu Lys Val Leu Leu 

Gly Ser Val Ser Gly Leu Ala Gly Gly Phe Val Gly Thr Pro All 

110 H5 120 

Asp Leu Val Asn Val Arg Met Gin Asn Asp Val Lys Leu Pro Gin 

125 130 jjc 

Gly Gin Arg Arg Asn Tyr Ala His Ala Leu Asp Gly Leu Tyr Arg 

140 145 15 q 

Val Ala Arg Glu Glu Gly Leu Arg Arg Leu Phe Ser Gly Ala Thr 

155 iso 16= 

Met Ala Ser Ser Arg Gly Ala Leu Val Thr Val Gly Gin Leu Ser 

175 180 
Cys Tyr Asp Gin Ala Lys Gin Leu Val Leu Ser Thr Gly Tyr Leu 

I*"* 190 195 

Ser Asp Asn He Phe Thr His Phe Val Ala Ser Phe He Ala Gly 

200 205 210 

Gly Cys Ala Thr Phe Leu Cys Gin Pro Leu Asp Val Leu Lys Thr 

215 220 225 

Arg Leu Met Asn Ser Lys Gly Glu Tyr Gin Gly Val Phe His Cys 

Ala Val Glu Thr Ala Lys Leu Gly Pro Leu Ala Phe Tyr Lys Gly 
245 250 255 

Leu Val Pro Ala Gly He Arg Leu He Pro His Thr Val Leu Thr 

260 265 270 

Phe Val Phe Leu Glu Gin Leu Arg Lys Asn Phe Gly lie Lys Val 

275 280 pflc: 

Pro Ser J85 



<210> 37 
<211> 497 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3295764CD1 



<400> 37 
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Phe Leu Pro Thr Ala Leu Leu Cys Ala Tyr Gly Phe Phe Ala Ser 
35 40 45 

Leu Arg Pro Ser Glu Pro Phe Leu Thr Pro Tyr Leu Leu Gly Pro 
50 55 6Q 

Asp Lys Asn Leu Thr Glu Arg Glu Val Phe Asn Glu He Tyr Pro 
65 70 J 7 - 

Val Trp Thr Tyr Ser Tyr Leu Val Leu Leu Phe Pro Val Phe Leu 

80 QC^ -» 

Ala Thr Asp Tyr Leu Arg Tyr Lys Pro Val Val Leu Leu Gin Gly 
95 100 in"; 

Leu Ser Leu lie Val Thr Trp Phe Met Leu Leu Tyr Ala Gin Gly 

HO 115 , 2 q 

Leu Leu Ala He Gin Phe Leu Glu Phe Phe Tyr Gly He Ala Thr 

125 130 135 

Ala Thr Glu He Ala Tyr Tyr Ser Tyr He Tyr Ser Val Val Asp 

140 145 150 

Leu Gly Met Tyr Gin Lys Val Thr Ser Tyr Cys Arg Ser Ala Thr 

15 5 160 1S5 

Leu Val Gly Phe Thr Val Gly Ser Val Leu Gly Gin He Leu Val 

175 180 
Ser Val Ala Gly Trp Ser Leu Phe Ser Leu Asn Val He Ser Leu 

185 i9Q 1 q( . 

Thr Cys Val Ser Val Ala Phe Ala Val Ala Trp Phe Leu Pro Met 

200 205 210 

Pro Gin Lys Ser Leu Phe Phe His His He Pro Ser Thr Cys Gin 

Arg Val Asn Gly lie Lys Val Gin Asn lly Gly He Val Thr III 

230 235 240 

Thr Pro Ala Ser Asn His Leu Pro Gly Trp Glu Asp He Glu Ser 

245 250 255 

Lys He Pro Leu Asn Met Glu Glu Pro Pro Val Glu Glu Pro Glu 

260 265 270 

Pro Lys Pro Asp Arg Leu Leu Val Leu Lys Val Leu Trp Asn Asp 

2'5 280 pa? 

Phe Leu Met Cys Tyr Ser Ser Arg Pro Leu Leu Cys Trp Ser Val 

250 295 300 

Trp Trp Ala Leu Ser Thr Cys Gly Tyr Phe Gin Val Val Asn Tyr 

3 °5 3iq 315 

Thr Gin Gly Leu Trp Glu Lys Val Met Pro Ser Arg Tyr Ala Ala 
0 325 330 

He Tyr Asn Gly Gly Val Glu Ala Val Ser Thr Leu Leu Gly Ala 

335 340 34c 

Val Ala Val Phe Ala Val Gly Tyr He Lys He Ser Trp Ser Thr 

350 355 , fin 

Trp Gly Glu Met Thr Leu Ser Leu Phe Ser Leu Leu He Ala Ala 

365 370 , 7 e 

Ala Val Tyr He Met Asp Thr Val Gly Asn He Trp Val Cys Tyr 

Ala Ser Tyr Val Val Phe Arg He He Tyr Met Leu Leu He Thr 

lie Ala Thr Phe Gin He Ala Ala Asn tlu Ser Met Glu Arg Tyr 

410 415 
Ala Leu Val Phe Gly Val Asn Thr Phe lie Ala Leu Ala Leu Gin 

425 430 4 ^t; 

Thr Leu Leu Thr Leu He Val Val Asp Ala Ser Gly Leu Gly Leu 

44 0 445 4 c 0 

Glu He Thr Thr Gin Phe Leu He Tyr Ala Ser Tyr Phe Ala Leu 

He Ala Val Val Phe Leu Ala Ser Gly Ala Val Ser Val Met "J 
0 475 480 

Lys Cys Arg Lys Leu Glu Asp Pro Gin Ser Ser Ser Gin Val Thr 

Thr ser 485 490 «5 

<210> 38 
<211> 228 
<212> PRT 

<213> Homo sapiens 



32/60 



WO 00/78953 PCT/US00/16668 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3438320CD1 

<400> 38 

Met Pro Arg Arg Gly Leu Val Ala Gly Pro Asp Leu Glu Tyr Phe 

Gin Arg Arg Tyr Phe Thx Pro Ala Glu Val Ala Gin His Asn Ar§ 

Pro Glu Asp Leu Trp Val Ser Tyr Leu G^y Arg Val Tyr Asp Leu 

jo 40 A c 

Thr Ser Leu Ala Gin Glu Tyr Lys Gly Asn Leu Leu Leu Lys Pro 

lie Val Glu Val Ala Gly Gin Asp He s" His Trp Phe Asp Pro 

65 70 7C . 

Lys Thr Arg Asp lie Arg Lys His lie Asp Pro Leu Thr Gly Cys 



Leu Arg Tyr Cys Tte Pro Arg Gly Arg Pne Val His Val Pro Pro 
95 100 105 



Gin Leu Pro Cys Ser Asp Trp Ala Asn Asp Phe Gly Lys Pro Trp 

Tyr Glu Val Gly _ 

Arg ser He Arg lie lie Asn Thr Leu Tnr Ser Gin Glu His Tte 



J -* LU 115 1 on 

Trp Gin Gly Ser Tyr Tyr Glu Val Gly Arg Leu Ser Ala Lys Thr 



Leu Glu Val Gly Val Leu Glu Ser lie Trp Glu lie Leu His Arg 
155 2.60 i fi? 

Tyr Leu Pro Tyr Asn Ser His Ala Ala Ser Tyr Thr Trp Lys Tyr 

Glu Gly Lys Asn Leu Asn Met Asp Phe Thr Leu Glu Glu Asn lly 

" 190 1 QC 

lie Arg Asp Glu Glu Glu Glu Phe Asp Tyr Leu Ser Met Asp Gly 
Thr Leu His Thr Pro Ala lie Leu Leu Tyr Phe Asn Asp Asp Leu 

Thr Glu Leu 225 

<210> 39 
<211> 273 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3 986488CD1 

<400> 39 

Met Ala Ala Thr lie Met He Leu Tyr Val Ser Lys Leu Asn Lys 

He He His Phe Pro Asp Phe Asp Lys L^s He Pro Val Lys Le5 

Phe Pro Leu Pro Leu Leu Tyr Val Gly A^n His He Ser Gly llu 
35 40 A c 

Ser Ser Thr Ser Lys Leu Ser Leu Pro Met Phe Thr Val Leu Arg 

Lys Phe Thr lie Pro Leu Thr Leu Leu Leu Glu Thr He lie Leu 

Gly Lys Gin Tyr Ser Leu Asn He He Leu Ser Val Phe Ala lie 

80 n - 



He Leu Gly Ala Phe He Ala Ala Gly Ser Asp Leu Ala Phe Asn 

100 105 
Asn Asp He Phe Thr Ala 
115 120 
Met Asp Pro Lys Glu Leu 
130 135 
Ala Cys Phe Met He He 

Pro Thr Leu He He Ser Val Ser Thr cJy Asp Leu Gin Gin S 



95 * 100 " i"05 

Phe Val Phe Leu 

_ Thr Lys Gin Lys 
125 230 " " 17? 

Gly Lys Tyr Gly Val Leu Phe Tyr Asn Ala Cys Phe Met He He 



Leu Glu Gly Tyr lie Phe Val Phe Leu Asn Asp He Phe Thr III 
Ala Asn Gly Val Tyr Thr Lys Gin Lys Met Asp Pro Lys Glu Jeu 



33/60 



WO 00/78953 



PCT/US00/16668 



155 165 
Trp Lys Asn Val Val Phe He Leu Gin 

Leu Leu Ser Cys Phe Leu Gly Phe Leu til Met Tyr Ser Thr HI 
185 190 1 qc 

Leu Cys Ser Tyr Tyr Asn Ser Ala Leu Thr Thr Ala Val Val Gly 
2nn 205 

Ser Val Ala Tyr lie Gly He Leu He „^ 
215 220 225 

Phe 

??° 235 240 

Ser 
255 

Asp 

Leu Lys Ser 265 2 ?0 



Thr Glu Phe Asn Gin Trp Lys Asn Val Val Phe lie Leu Gin Phe 

Phe 
185 
Tyr 

200 205 oin 

Ala lie Lys Asn Val Ser Val Ala Tyr lie Gly He Leu lie SJ 

^lb 220 
Gly Asp Tyr lie Phe Ser Leu Leu Asn Phe Val Gly Leu Asn I 

tSv 235 **Af\ 

Cys Met Ala Gly Gly Leu Arg Tyr Ser Phe Leu Thr Leu Ser Ser 

250 *5i;r 
Gin Leu Lys Pro Lys Pro Val Gly Glu Glu Asn lie Cys Leu III 



<210> 40 

<211> 206 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4378816CD1 

<400> 40 

Met Gly Ala Glu Trp Glu Leu Gly Ala Glu Ala Gly Gly Ser Leu 

Leu. Leu Cys Ala Ala Leu Leu Ala Ala G }y Cys Ala Leu Gly Leu 
20 25 30 

„sp 

35 . 40 45 

. - _ , - TO 

Phe Val Tyr Leu Ser Leu Val Gly Asn v2 Ala Asn Ser Asp sly 



Arg Leu Gly Arg Gly Gin Gly Ala Ala Asp Arg Gly Ala Leu He 

40 AK 

Trp Leu Cys Tyr Asp Ala Leu Val His Phe Ala Leu Glu Gly Pro 



. 65 7 0 75 

J 80 ^ ^ *** GlY LyS Ala ^ Ala 

Trp Val Tyr Phe Asp Pro Thr lie Val Ser Val Glu He Leu tS 



80 

ksp 

J 5 _ ■ . 100 los 

Phe Leu He Tyr Ala He 

US 120 
-Jr „ ^ lyr „ v H1S pfte Leu Qln Ile ^ 

130 

Val Cys Glu Leu Tyr Gly Cys Trp Met Thr Phe Leu Pro Glu Trp 



Val Ala Leu Asp Gly Ser Leu Ala Leu Phe Leu He Tyr Ala III 
Tyr Arg His Phe 
Gly Cys Trp Met 
Leu Thr Arg ser Pro Asn Leu Asn Thr III Asn Trp Leu Tyr Jys 

160 . i-r 

Trp Leu Tyr Leu Phe Phe Phe Asn Gly Val Trp Val Leu lie Pro 

175 

Gly Leu Leu Leu Trp Gin Ser Trp Leu Glu Leu Lys Lys Met HI 

Gin Lys Glu Thr Ser Ser Val Lys Lys Pne Gin 195 

200 20S 
<210> 41 ^ 
<211> 235 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4797137CD1 

<400> 41 

Met Gin Gin Arg Gly Ala Ala Gly Ser Arg Gly Cys Ala Leu Phe 
Pro Leu Leu Gly Val Leu Phe Phe Gin G " Val Tyr He Val Pne 
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20 



Ser Leu Glu lie A^g Ala Asp Ala His Val Arg Gly Tyr Val Gly 



Glu Lys He Lys Leu Lys Cys Thr Phe Lys Ser Thr Ser Asp Val 



Thr Asp Lys Leu Tta He Asp Trp Thr Tyr Arg Pro Pro Ser Ser 
65 70 7c 

Ser His Thr Val Ser He Phe His Tyr Gin Ser Phe Gin Tyr Pro 

85 on 
Thr Thr Ala Gly Thr Phe Arg Asp Arg lie Ser Trp Val Gly Asn 

Val Tyr Lys Gly Asp Ala Ser lie Ser III Ser Asn Pro Thr III 

110 115 1 on 

Lys Asp Asn Gly Thr Phe Ser Cys Ala Val Lys Asn Pro Pro Asp 

125 - £T 

Val His His Asn lie Pro Met Thr Glu Leu Thr Val Thr Glu Arg 

140 145 - _ = 

Gly Phe Gly Thr Met Leu Ser Ser Val Ala Leu Leu Ser lie Leu" 

155 i cc 

Val Phe val Pro Ser Ala Val Val Val Ala Leu Leu Leu Val Arg 

1 /0 175 i qa 

Met Gly Arg Lys Ala Ala Gly Leu Lys Lys Arg Ser Arg Ser Gly 

185 igo n qc 

Tyr Lys Lys Ser Ser lie Glu Val Ser Asp Asp Thr Asp Gin Gin 
-cUU 205 "3in 

Glu Glu Glu Ala Cys Met Ala Arg Leu Cys Val Arg Cys Ala 111 

^ , 215 220 One 

Cys Leu Asp Ser Asp Tyr Glu Glu Thr Tyr 

230 235 
<210> 42 D 

<211> 147 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 5470806CD1 

<400> 42 

Met Ala Ser Leu Arg Leu Phe Leu Leu Cys Leu Ala Val Leu He 
Phe Ala Ser Glu Ala Gly Pro Gly Gly A^a Gly Glu Ser Lys cjs 
Pro Leu Met Val Lys Val Leu Asp Ala Va! Arg Gly Ser Pro A^a 



Val Asp Val Ala Val Lys Val Phe Lys Lys Thr Ala Asp Gly Ser 

Trp Glu Pro Phe Ala Ser Gly Lys Thr A^a Glu Ser Gly Glu Leu 
65 70 7t - 

H lS Gly Leu Thr Thr Asp Glu Lys Phe Thr Glu Gly Val Tyr Arg 

85 90 



80 



Val Glu Leu Asp Thr Lys Ser Tyr Trp Lys Ala Leu Gly lie Ser 



Gly His Arg His T^r Thr He Ala Ala HI Leu Ser Pro Tyr Ser 



Pro Phe His Glu Tyr Ala Glu Val Val Phe Thr Ala Asn Asp III 

Tyr 

125 130 " ~ 

Tyr Ser Thr Thr Ala Val Val Ser Asn Pro Gin Asn 

140 i 4 c 
<210> 43 14b 
<211> 147 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5473242CD1 
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<400> 43 

Met Val His Leu Thr Asp Ala 

1 5 
Trp Gly Lys Val Asn Pro Val 
20 

Ser Leu Leu lie Val Tyr Pro 
35 

Phe Gly Asp Leu Ser Ser Val 
50 

Val Lys Ala His Gly Glu Lys 
65 

Leu Lys His Leu Asp Asn Leu 
80 

Glu Leu His Cys Asp Lys Leu 
95 

Leu Leu Gly Asn Met He Val 
110 

Lys Glu Phe Thr Pro Ser Ala 
125 

Ala Gly Val Ala Ser Ala Leu 
140 

<210> 44 
<211> 2701 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 264114CB1 



PCT/US00/16668 



Glu 


Lys 


Ala 


Thr 


Val 


Asn Gly 


- 

i->eu 


Glu 




10 










1 5 
xj 


He 


Gly 
25 


Ala 


Glu 


Ser 


Leu 


Ala 


Trp Thr 


Gin 


Arg 


Tyr 


Phe 


Ser 


Lvs 


Ser 




40 










45 

"a J 


Ala 


He 


Met Gly 


Asn 


Pro 


Gin 


Val 




55 










60 


He 


Asn 


Ala 


Phe 


Asp Asp 


Gly 






70 










/ 3 


Lys Gly 


Thr 


Phe 


Ala 


Ser 


Leu 


Ser 


His 




85 










90 


Val 


Asp 


Pro 


Glu 


Asn 


Phe 


Arg 


He 




100 










105 


Met 


Met 


Gly His 


His 


Leu 


Gly 


Gin 




115 










120 


Ala 


Ala 


Phe 


Gin 


Lys 


Val 


Val 


Ala 




130 








135 


His 


Lys 
145 


Tyr 


His 







<400> 44 
gcggcggcgc 
gagggccacg 
agcggatccc 
ggatgttgcc 
gcagatttga 
caggatgacc 
gtcagtcttg 
tattcacttg 
tggaatacag 
ttgttggaac 
gttgagggca 
agggccatgt 
gtcgatgctg 
aatgacagca 
ttacaacgtt 
ttgagttttg 
gagcacaagt 
ctgatagata 
gctgctatgc 
cttagccaaa 
ctgccaccca 
ttcaaaccaa 
gaaaatcctt 
aagttatgta 
gatatctaca 
acaagcaaac 
caaaatacct 
tgctgtgaag 
aggaccgtag 
tgtgcttggc 
aacctcatgt 
actagtaggt 
ttgcatattg 
gtaactgtgg 
gctaagaggt 
tttatgcgag 
gaacttgacc 



cagcttcctc 
cccccgacgc 
acacctccgg 
acaaaatcta 
gcacaatgtc 
taggagtgat 
gtactgaatg 
accttccatc 
tcatgtccac 
agttgaaact 
ctgaattccc 
ttatgagtgg 
ccaccttaca 
ctgtagaaca 
gtcgagaata 
ctgatctctt 
tcactgctgt 
ttctcagtag 
tgtggctaga 
tcagaattga 
atgataagtc 
gacttgggat 
gtagtcttta 
gcccaccagc 
tagcaggggg 
ttcagactgc 
ggtttccaaa 
gctatatcta 
aaagatacga 
aatggagtgc 
actgttattt 
cctttgcttc 
ctaccaattc 
aaatttatga 
actctgaccc 
aaacccactt 

ggtggtctct 



ggccggaggg 
cgcgccggag 
accgagggac 
cctcgtgtat 
tgcagactgt 
tctaagtctg 
ctgcagataa 
agtaagacgg 
tcaagacgag 
gttttatgaa 
ttgtcataag 
actaagtgaa 
gataataata 
gctttatgaa 
tttaattaaa 
cagttgtgag 
gtatcatcag 
tgacaattta 
gtataacaca 
tgcactttca 
agtggtggtt 
gactaaagag 
ctcttctgtc 
tgatttgcat 
tcaagttcct 
cttcagaact 
gaccccaatg 
tgcaattgga 
cactgagaaa 
agcagttgtg 
tccaaggtct 
agctgcagct 
cggcataaga 
tgtgaataaa 
ctgtgttaga 
aaatgagcga 
gcggcagcat 



gaggcgagac 
gggcccagtg 
gcggttactc 
ttttctcttt 
gttgaaaaac 
tgtaacaaga 
cagcaagtag 
acttttctaa 
aggcagatca 
cagcagttgt 
atggttcttg 
agcaaacaaa 
acttatgcat 
acagcttgct 
aaaataaatg 
gaattaaaac 
gacgcgttca 
aatgtagaaa 
gaatcacgat 
gaagtaacac 
caaggtctgt 
gaaatgatga 
tgttacagcc 
aaggttggga 
ctgaaaaaca 
gtgaattgct 
ctttttgtcc 
ggagatagcg 
gatgagtgga 
gttcatgact 
gactcatggg 
tttggtgata 
ctcccctctg 
aatgagtgga 
gctgttgtga 
gctaaatacg 
atatctgaac 



cccaggcgag 
tggacggggc 
cacaggatcc 
cactcatgag 
tctgaagaac 
tattactcat 
gttctccttt 
tctgttcctg 
atactgaata 
ttactgacat 
caacatgtag 
cccatgtaca 
acacgggtaa 
tcctacaggt 
cagagaattg 
agagtgctaa 
tgcagctgtc 
aggaagaaac 
cccagtattt 
agagagcttg 
ataagtccat 
ttttcattga 
cccaagcaga 
ccgttgtaac 
caaaaacaaa 
tttattggtt 
gcataaagcc 
taggtggaga 
cgatggtaag 
gcatttatgt 
tagaaatggc 
aaattttcta 
gcactgtaga 
aaatggcagc 
tctcaaattc 
tcacctacca 
gtgtactgtg 



gccgcggcgg 
caccggctgg 
gctgaacata 
ctgcacaatt 
ctaattaaca 
tagtgaatgt 
atttctgaag 
gagatattaa 
tgctgtgtca 
agtgttaatt 
ctcttatttc 
cctgaggaat 
cttggcaatg 
agaagatgtg 
tgtacgattg 
aagaatggtg 
acatgaccta 
cgttcgagaa 
gtcttctgtt 
gtttcaaggt 
gcccaagttt 
agcatcttca 
aaaagtttac 
tcctgataat 
tcacagtaaa 
tgatgcacag 
atctttggtt 
acttaatcgg 
ccctttacct 
gatgacactg 
catgagacag 
tattggaggg 
tgggtcttca 
caacatccct 
tctatgtgtg 
atatgacctg 
ggacttgggg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
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alafcaccaa Eg£S£ Illicit Stilts « c « t W gtctccatgg 2280 
atggttgcac taccacctat atan^f gggacagaag agtttgaact ggatggagaa 2340 
ctttgtcatt ttctttoe^ 99993 agttca g99 a gtgcacgcct gagctatgtg 2400 

aattctatct tJgatafatt tt*^*? 3 ? gctatgaaag aactaaatat gagtacaEaa 2460 
catatcagtg ag?ctcacaa " ?«««*ta atatttgcat cagtataata 2520 

= = ? ag^cccacag aaagatatgc ttccataata tgaaataqat tattcaataa j^an 

tllllllltt gtacS?tca SK££ t^*^ W-tftZtl acattgS 2640 

t y y gtacagttca aaaagttcat ttataaaagt agtttcctgt tcctagttga 2700 

2701 

<210> 45 

<211> 736 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1455669CB1 

<400> 45 

Itgctgaglc ^acS cgctgggacc cacgacgaca gaaggcgccg atggccgcgc 60 

Sc!t?ct gagc g tg?gc «SSSS« ggaaccaga c agagcctgaa ccgccgcL So 

tctgggtSct ?g?tcc«?c tacctS?^ " g " ggggt ^tgggtaccc cccatgtacc 180 

a2S " 32 S^SS SSKSS SSESSS 

?ccta?cggg g?g?aga"agg 9 ? aacctggt wS?tS agflgalt^ 3fiS 

gaggtcal?? 2£2K£ Iggttcfccc aaacaaatft ^ta^^ 9 aaaggctaca 420 
catacacaaa ca?»?nr a( .» ^astLccccc aaacagattt tgtaattccg aaaatccacg 480 

ggclt^ttcc laaacltar 3 S?£ CCCa £ a ttcct gg* ca gtttatagct accataacc? 540 

asss i ™ ™< ssss jsssss ss 

tccugtgtg aaaaaa gatatgtatt taccttagtg tgttaaataa acactggcat 720 



<210> 46 
<211> 1826 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2084989CB1 

<400> 46 



IcccaStct atlllttlca 1%??*?° ctggtcccgg gcagcggagt tctttaccca 60 
gttcclctcc ggctcglacc acctc?^ "ccgcaaat "agggcttg ggtctggctt 120 
attgccaccc Itcccfaoac !" C tgagccgagc cagctaccgg ggctcctgga 180 

gggacgtglg ccta!a???o 9399 cctccgt 99 a gggacgtcac ggggcagagc 240 

c?cgtgc?cl CC aaS "gcaggcgt g Ctctgtgtg g t gg Ct gggc tctgccaatc 300 
acglgtgacg tcccgcg?ga laaclaaZZ?, agctcctgcc cctccctgct ggtcggcgtc 360 
aaggcggttc talafa^ ™^ B9B ? ggcccg 9 c g9 cgacagcgga ggcagagagg 420 
calculate clgaalcctc SXESS" ggaaagcaaa atgggtgaat tgccttEagl 480 
tttcactgtt IcSatcctc aaaallt^* aagtactttc "gggcagag cccggcactt 540 
gaacatcgto caolaet*™ £*. 3tctgct g«gtccggg gcacagctgg aagcttctcg 600 
gtgSagggcc aaatatat^ !?2 CCggcgt ggtgacccca gggatcaccg aggaccagc? 660 
fcgatggl cgXgfcfg cccaaato^ rZ?*^ 9 gacacagggg agaaggtggt 720 
cacattc?Ic algaagaccc caaccataat of^S"' accatcactg getgeatget 780 
tgccattgtt alltactcca accac?a?aa ?™ 95 39 tgggtgaatc agtccttcaa 840 
gacaacctat accgcagtgg tgacactccc atcactgtga ggcagctggg 900 

cfcclagcac Select tagtcclca'a Tr? 9 ?™ 9 gccctggg ~ tcaaatccl? 960 
caactgcatc aacltccccc llllalaarl a " tgtgc ^ c "tgcagcag tggcagctgc 1020 
tgatgaggca ggtcagaoac tla^tt^t Wagagctg caggtgggca tcccggtggc 1080 
ggtggtgatt llllalattr ° ggtgact g ca gecaagcagg gaatcttcca 1140 

Sganetetg Salaaaaa ISS*?** tcctgccatg gccatcccac cactgatcat 1200 
aEgggcgltg gctgggaaal otaarra^ ggtaggcgac tgtacctctc ttgtcctgga 1260 
teta^gaett gcclgccctt c cctaa^^ ^ccaactctc tctccagcct cgcctgattc 1320 
agcaagaatg alctgjgctg gSotalaaaa " gcaccgcc tcctccacct tegttcatte 1380 
y ccgggccg gggtgaagga actctgeagg ggcaggagga gaggacaaag 1440 
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gaaggaaacc aacttcatca gtattactec a fit-rrrrr- 

catagtcatg cttgatctca tgctctttct gacacacaga aggggactgt 1500 

tacacacggl tgalcacgca clcacaaaca tLar.nn' a 9tgcctcca tactgagagg 1560 

caagctctgg ctcattcaca tgaacaggac acgaaagcaa agcacaggaa 1620 

gcatgttS gggctgagga Stggcacta aacacaa^ 9 "" gagtgc catgcaccag 1680 

aagtttatag tctaatacaa Lr««» aacacaatgg ttatggcccc tgtcctcatg 1740 

agLagtgt! alaftalaal aaaaaa 3 acaaggaggc ^cccacataa atacattctt 1800 



<210> 47 

<211> 1325 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2501034CB1 

<400> 47 



1826 



tgcaSIggg SagggSac SSSSSS SE"'™ 9 Caacggcggg cggagcggcc 60 
cctgccagta caggllcatc tggcagcgac tcctactgct gcctcggcct 120 

accacggtgt gctaafcatf I^!? * f aggaggccaa agagttgtac cgagactata 180 
tcgag?Itga tcet£t£c £c£™?£ aatcctgcca gaaacctgta gacaaatata 240 
gacalattct tttcfatact ™^ 9 5* atgctatatt gtgcaaagct caggcctaca 300 
Sttgtgaagc atacctjaga taa?S ^ catggaaa actctgcata ttttgtttgc 360 
atgacttgH cagltalglf algafataaa ^f?'?'^ caacca 9 aa t actgcccctg 420 
tagaacaaac tgcctatttt at?S? a "tctatag aatgtttgcg attgctgctt 480 
cggcaaaaaa aaagcccaac "StttEe ^ CttCCt gtgggtagaa cggcccatga 540 
gaaaactctt gctgattccl gct§tcat?t ^ ^ 9C a "attatta tctagctacg 600 

aactcattaa Igtltttgtt cttlca^caa att?teSSS Vt?***? gtgtgcctca 660 
tcaaccgtaa gctctccttc tta^^ attttcaggc aattagagtg accctaaaca 720 

tctacttctt Icagagtatg gaf?aaaa?a r? 39 *"^" actgct *> aa agcatcatgg 780 
aggacttctg aaaaattt^f J??;? 9 ?? g ttg 9 aa 9 c g a ttatgccatc tttaaatctc 840 
ailgaaaagl catglaata* aaaloaa^ 3 ctatc ^tgg catgaccagc tgtatctgaa 900 
gattacxaaac tartt^tt aaaccaacct cctcatttct gttgagtaaa atgaagcaaa 960 
Eg£Z£ aagagacfag C " t9gatac cdcclccac 102% 

ccacatggac acaawaata tton^S™ acagatattc acagtcacac atagaaacac 1080 
atacgtffta clcafggccl gg?aaccact E£ 9 ?? tSa Catgcaacag Maaggttt 1140 
ttcattcctt tgacafjtt* «Ee2t?tt ? C a ^ ataataaa acatctaggt 1200 

tttgtgttct tacaataaaa ~ a ^" tt: aatt taaatg ttgttactgg cttaaaatat 1260 
aaaaa tacaata g aa acgcttttaa taaagtcttt cagaataaac caaaaaaaaa 1320 



<210> 48 

<211> 1832 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2745212CB1 

<400> 48 



tllTgtllll 9CgggtCgcg «tgctgt«.g gtctgcgggc 60 

gttglgtggg «aaacaa?f ™ 9 9 ? * ^gcagtgcct gacggcgcgt ctgacgcgga 120 
cgallttlll Itala?aac a 999 ? 9C " ta gtcg 99ggtg gtgggagaag gaggaggcgg 180 
glgtgctgtg ctttca^gg ccfct?^ 9 ? a 'f gaagcC "aattccag gagggtgagS 240 
aggacaaaca agtaaaala? t?r»^«^ atgaagcaaa gtgtgtaaag gttgccataa 300 
gogttccgaa oIaIa aa ^ a f "catacatt aca 9tggttg gaataaaaat tgggatgaat 360 
??caaaaSc clftcfaaaa caa^f 309 tggacaccaa tttgcagaaa cagcgagaac 420 
agaagaca?c tg^ctgca! calaaalata 3 ? gggaagat sagaggggct gccccaggaa 480 
aaacacctqg aaata^L? 5 9 ttgaa 9 t 9 aa aacgaaaaag aacaaacaga 540 

aaagggccc? ggtagftlct Ict«talaa ^ agtgagaC ccctcagcct cctcggaaga 600 
ttaaagtaal glttfctgaa aaSSr atga 9gaaac attcatgaac agagttgaag 660 
ccaggcaaaa acagctc^t S?^?^" cgtggcttgt tgatgactgg gacttaatta 720 
attaHgcaaa ttalaaaaaa trrr^ 9 ccaagaa 9 aa tgtggattcc attcttgagg 780 
aagttgtggl agggatlaaa clatfc?? 33 aca * agataa taaggagtat gcggttaatg 840 
aatttgallg aclfcagtat a cgtaatgtt gggtacccag ctactctata 900 

a gag accacagtat gctgaaattc ttgcagatca tcccgatgca cccatgtccc 960 
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aggtgtatgg 
atacacctct 
agtacctggc 
ctgagtacca 
taaacacatt 
tgtataacaa 
taatagctcc 
tcaatgatgg 
gcttttgaat 
tgctatatga 
gaacacactc 
tcctttgcaa 
aatggtggct 
cgcaatgcat 
atcttcaata 



agcgccacat 
ggatgagaag 
aaagaattct 
tcggaaagct 
tttgttctta 
ttgatgtttg 
ttttttcttc 
acaacagagg 
gctggtggtt 
caaaatgctc 
atccatttgt 
gtcatccatg 
tgtttcatgg 
tgtgtagcta 
aattttttct 



ctcctgagat 
agccttgctt 
gcaactttgt 
gtgtgagagg 
gtctatctct 
ttttctgttt 
tttctttttt 
gatatgctgt 
ctattccttt 
tgattcctag 
gcttttgttt 
ttgttactta 
tttttgtatt 
gttttctgga 
ttaaatttca 



tatttgtacg 
tattactcaa 
tcagtgccag 
cactctcact 
tgtacaaacg 
gattttaaac 
tttttcattt 
agagtgtttt 
gacactacgc 
tgccaaaggt 
ttttttatgg 
ggcattttat 
tgtgtctaat 
aaagtcaatc 
aa 



aattggagca 
ttatcttcac 
cgattatgaa 
cacttatgtt 
atgtgctttg 
agagaaaaaa 
caaaattgct 
attgcctagt 
acttttataa 
tcaattcagt 
tgcttaaagt 
cttggctcaa 
gcacgtttta 
ttttaggaat 



atgttggctt 
gatttcctaa 
gtggctcctc 
tggatctccg 
aagatgttag 
taaaaggggg 
gccagtgttt 
tgacaaagct 
tacatgttaa 
gtatataact 
aaagagccca 
attgttgaag 
acatgataga 
tgtttttcag 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1832 



<210> 49 

<211> 1211 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4833111CB1 



<400> 49 

gacagacccg 

cggcttggat 

agcgctggtg 

taccattaaa 

aatatatgct 

tatcacctat 

tatgaaacat 

atctgaagtg 

ctctaacatc 

tttaagagag 

ctggtcctgg 

tgcaggtgga 

gctggcaaag 

ctggcggtca 

cagtctggga 

agttggcaga 

gggcctgcag 

gaagaccagt 

tggtatgaag 

ggtactctga 

aaaaaaaaaa 



cctcaaacat 
tgtagccttg 
gctggtgggg 
accaggctgc 
ggcgttcctt 
gaatatgtga 
atgttggctg 
gttaagcaga 
tcatatgaag 
attccttttt 
aggcaggatc 
tttgccgctg 
gctggctcca 
caggggctgg 
ggtttcatct 
aagagtcctt 
tgcaaaccct 
tgtgctaaga 
tcattggcct 
acaatttcct 
a 



ggcggcgccc 
acgaggtctg 
tagcaggtgt 
agagtcccca 
ctgctgctat 
agtggttttt 
cctctgctgg 
gggcacaggt 
agggtatcca 
ctttggtcca 
atgtggtgga 
cagtcaccac 
gcactgctga 
caggattatt 
ttctgggggc 
gaagcagaga 
cttccgctga 
taccggcatg 
gtatgccaga 
cagaacctct 



agcgcgcgag 
agcgaccatg 
ttctgttgac 
aggatttagt 
tggatccttt 
gcatgctgat 
agaagtggtt 
atctgcttct 
agggttgtat 
gtttccctta 
ttcttggcag 
ccctctagac 
tgggaatgtg 
tgcaggtgtc 
ttatgaccga 
caagcctcac 
gcagctgtct 
gagattgtgc 
gagctaagag 
taataaataa 



gacgtgatcc 
gaccggccgg 
ttgatattat 
aaggctggtg 
cctaatgctg 
tcatcttcgt 
gcctgcctga 
acaagaacat 
cgaggctata 
tgggagtcct 
tcagcagtct 
gtggcaaaga 
ctctctgtcc 
ttccctcgaa 
acgcacagct 
ctccacttct 
gaactatagg 
catccgtggt 
aagaaaacgg 
gtttggtaat 



gcttctgctc 
ggttcgtggc 
ttcctctgga 
gttttcatgg 
ctgcattttt 
atttgacacc 
ttcgagttcc 
ttcagatttt 
aaagcacagt 
taaaagccct 
gtggagcttt 
caagaattac 
tgcatggggt 
tggcagccat 
tgctgttgga 
gtcaagagag 
ccccagtgct 
ataggctggc 
ggtctgtggc 
gctgagaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1211 



<210> 50 

<211> 1046 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 876677CB1 



<400> 50 

cccacgcgtc 

acatctgagt 

ataaaagctt 

agagaatttc 

cctctctttc 

cattaaggag 

caaggggcca 

ccgaacagtt 

tcaggtagat 

caacgatgtc 



cgggaatgtc 
tgctgcctgt 
ctcaagaatg 
gtgggtgtca 
ctgttggtga 
tatgaacgtg 
ggtttgatcc 
acttgcaaca 
ggagttgtct 
catcaagcaa 



tttcactctc 
tcttcacact 
agatggattc 
acaataaacg 
tcattacctt 
ctgttgtatt 
tggtcctgcc 
ttcctccaca 
attacagaat 
catttctgct 



tcatactttc 
tagctccaaa 
tagggtgtct 
gcttggtgta 
ccccatctcc 
ccgtctggga 
atgcatagat 
agagatcctc 
ctatagtgct 
ggctcaaacc 



tcctctcccc 
cccatgaaaa 
tcacctgaga 
tgtggctgga 
atatggatgt 
cgcatccaag 
gtgtttgtca 
accagagact 
gtctcagcag 
actctgagaa 



tctcccaagc 60 
attgccaagt 120 
agcaagataa 180 
tcctgttttc 240 
gcttgaagat 300 
ctgacaaagc 360 
aagttgacct 420 
ccgtaactac 480 
tggctaatgt 540 
atgtcttagg 600 
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gacacagacc 
tttacttgat 
tgttcggatt 
agcgagagcc 
agcctccatg 
gagcacggta 
agagggcatt 
tcctcttgcg 



ttgtcccaga 
gatgccaccg 
cccgtgcagt 
aaggtccttg 
gtgctggctg 
gccaccgaga 
ggtggcgtca 
gtagtcagct 



tcttagctgg 
aactgtgggg 
tgcagagatc 
cagctgaagg 
agtctcccat 
agaattctac 
gctatgataa 
attgca 



acgagaagag 
gatccgggtg 
catggcagcc 
agaaatgaat 
agctctccag 
gattgtgttt 
ccacaagaag 



atcgcccata 
gcccgagtgg 
gaggctgagg 
gcttccaaat 
ctgcgctacc 
cctctgccca 
cttccaaata 



gcatccagac 660 
aaatcaaaga 720 
ccacccggga 780 
ccctgaagtc 840 
tgcagacctt 900 
tgaatatact 960 
aagcctgagg 1020 
1046 



<210> 51 
<211> 1660 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 



2326143CB1 



<400> 51 

gctcccctgc 

cggcgcgggc 

gccgccaccg 

aaaggaaatt 

gcatcttgtt 

tgcatattaa 

cagccacagt 

caagaacatg 

gcagcgttac 

cttgaacaca 

atgtttcacc 

actagcagtt 

gaagccatgg 

ggaggacagc 

ctgcagctag 

gcaacccggc 

accaacatag 

acaaggttga 

gcagaccgca 

tcatcctcag 

gatattatgc 

aatgagcctc 

ctgtatttgc 

caggtctgtc 

taatcatttc 

ggtgatctct 

actgaagagg 

aataaaaact 



ccaccccgcc 

agccgtgacc 

tctgaactag 

ttcgaggtcg 

atgaaagtgg 

caagggtaga 

cttttacttg 

ttacttctga 

ctggaggcga 

gagcccctag 

ctggccgggg 

ctactggtgg 

atcctatagc 

ttaattcctc 

aacggcagca 

gtactaacac 

ctaatacaga 

atgatcctaa 

gcctgtttgt 

atgaggatga 

ctttagatgt 

caccacctcc 

caatgaaagt 

actcttgtta 

cacttaacta 

gcttcctgta 

aaaattgaaa 

gcttggatgg 



cccgtggccg 

cccgcggggg 

gatgtcccga 

cagatataag 

tgcaacaaca 

ttttgattta 

tccctattgt 

acatgcagaa 

tcctaatcat 

agatttagat 

attaggaggt 

actttcttct 

tgagctttta 

tggcccttcc 

tgcccaggca 

aagcagtgtc 

aagcagtcag 

aatgtctgaa 

ccaagagctc 

tcggggggag 

tgctttagaa 

tctttgatga 

ggacaacaac 

cattgtgtac 

atttttactt 

ccttgacatg 

tctctaatga 

tttaccttaa 



agcccgggag 

acactgcagc 

catgaaggtg 

tgtttaattt 

acaaggcata 

tactatggtg 

ggaaaaatgg 

acatcaacag 

gtcacggatg 

gaatcgagtg 

cctcgtgctc 

tctcagagtt 

tctcagttat 

gcttctcagt 

gcacggcaac 

accactacaa 

cagactctac 

acggagcgcc 

cttctgtcca 

atggcagatt 

aacctaaatt 

catcccaatt 

tatcttgggt 

attcaaaagg 

ctagcaggta 

caaaaggctc 

agctgctgtg 

aaaaaaaaaa 



tcgagtggga 

cggagcccgg 

tcagctgtga 

gctacgatta 

caactgacca 

gggaagcttt 

gctatacgga 

aagtgatttg 

actttgcagc 

gtgttcgaca 

gtagatcaaa 

catattctcc 

caggagtgag 

tacaacaact 

aactggagac 

tcacacaatc 

agaattccca 

agtccatgga 

ctttagtgcg 

ttggtgctat 

taaaagagag 

cgcagacaat 

ttgtttggtg 

aagagagaaa 

aatgtaggta 

tcctaatact 

tgtatttatg 



gtcggccggc 

gaggggccgc 

tgcatgttta 

cgatctttgt 

cccaatgcag 

ctctgtagag 

gacatctctt 

tccaatatgt 

tcatcttaca 

tgtacgtaga 

catgcacttt 

aagcaatagg 

acgttctgca 

gcagatgcag 

cgcacgcaac 

cacagcaaca 

gtttctttta 

aagcgagcgt 

tgaagagagc 

gggctgtgta 

taataaagga 

gtcctctgtg 

attgtaattt 

atatatatga 

gcagtgcagg 

ccacattcaa 

aatattaatg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1660 



<210> 52 

<211> 1110 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: 



2786302CB1 



<400> 52 

cctttattag 

cagtttcttc 

ggataaagaa 

tttagatttg 

ttttaaaaaa 

aacctttctt 

tcaaatcttc 

ttgagtgtga 

cttcagagtg 

ataatgaata 



agaaagagtg 
tgtcttctgc 
taactctagg 
gctttaaaag 
attaatgtgg 
ccacatgaga 
aagaaaatga 
atgccatggt 
ttattttcct 
aagacccagt 



acctatctac 
tcctggacat 
acaagagttg 
cccatactat 
gctttttaat 
agtattttac 
tcaaatcttc 
gatgctaatt 
ttttgtcata 
tttcagaatc 



cccttacttg 
ggcatgaaaa 
ctttggggct 
tttattagct 
tctttctcag 
aagttgcttg 
tgtgacaaaa 
ctgattattt 
aggtgtctgg 
tctccacgga 



gggaagactc 
cggaagtttg 
aaacctatct 
cttctatgct 
tgggagggtg 
tcaaatttaa 
aaatggacaa 
ttcttttctt 
aaatgaagta 
gtagagaaac 



tctgcttcaa 60 
agtatttcaa 120 
cctgtattca 180 
ttttaaaagt 240 
agtttgaata 300 
aagaaaatga 3 60 
atattcacca 420 
ttagccatac 480 
tggaaatgaa 540 
tcatcccaat 600 
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ccggaagagc ccgaagaaga agatgaagat 
gcactcactg ctccaaactt ggaggaggaa 
gaatattatg agacaaagaa aagttgcttt 
aatgtttcct tttgtgttaa aaaaggtgaa 
ggtaaaagta cttccattaa aatgataact 
gtgttacaag gcagcagagc atcagtaagg 
gggtactgcc ctcaggagaa ctcactgtgg 
ttgtatgcag ctgtggaaag actggggcaa 
ggtgggaggt cttaagctcc aggaacaact 

<210> 53 
<211> 1120 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3735780CB1 



gttcaagctg aaagagtcca agcagcaaat 660 
ccagtcataa ctgcaagctg tttacacaag 720 
tcaacaagaa agaagaaaat agccatcaga 780 
gttttgggac tactaggaca caatggagct 840 
gggtgcacaa agccaactgc aggagtggtg 900 
caacagcatg acaacagcct caagttcttg 960 
cccaagctta ccatgaaaga gcacttggag 1020 
aaaagatgct gctctcagta tttcacgatt 1080 

1110 



<400> 53 

gaaatccagt 

agaaattggg 

aggttaaaaa 

ctcatgaaaa 

caaatgttgg 

caatcacatt 

gggatgcaat 

ccccattgat 

cctgccttct 

tgcttgcagg 

aagaccacaa 

gtcatggaat 

gcaatgtctt 

ctcacagtgc 

gattcttgtt 

aatttcagtc 

taaatctttt 

tcaatgcaac 

tttatcaact 



tatcaaaatt 
aacattatca 
aagtccttca 
gaggccacca 
tgagatgaag 
tcccattcag 
acttcagttg 
gcagaagaca 
ccacaagcat 
gacaacagaa 
acatcatgac 
tggagagtat 
gtttttcggc 
tcatctggtc 
ttttccaatt 
tttccccaag 
cagaggtgcc 
ttatgagttc 
gaatagacct 



gactcaagaa 
caaagctatc 
tgaaaaagaa 
atactaacat 
cattacttgt 
aaggtcctct 
agaagggatg 
actacgcttg 
gtcagtgctc 
gcaattttca 
aaatttacca 
tatcgaggct 
cttcgaggtc 
aatgatttta 
aatgttgtaa 
gttttccaaa 
catctgaatt 
ttgttaaagg 
tctaagaaga 



gagagaacct 
atcctgccaa 
agatcttaag 
cttcaaaaca 
gtggctgctg 
ttcgacaaca 
gatttcgaaa 
cacttatgtt 
cagagtttgc 
ctccactgga 
acacttacca 
tggtgcccat 
ccattaagga 
tctgtggagg 
aaactcgcat 
aaatctggct 
accatcggtc 
ttatatgaaa 
aaaaaaaaaa 



aacagaacaa 
actccaggct 
cagcatgatg 
agatatatca 
tgcagccttc 
gctgtatggc 
tttgtatcgt 
tggtctgtat 
aaccagtggc 
aagagttcag 
ggctttcaag 
tcttttccgg 
gcatctgcct 
tctattgggt 
acagtctcag 
ggaacgggac 
cctcatctct 
aaaccatcag 



taacaatgga 
cagatgtcac 
gattcagaag 
cctcatatta 
aacaatgtcg 
atcaaaaccc 
ggaatccttc 
gaggatttat 
gtggcggcag 
acattgcttc 
gcactgaaat 
aatggactca 
accgcaacga 
gccatgttgg 
attggtgggg 
agaaaactga 
tggggcataa 
ttaagtgcca 



<210> 54 

<211> 886 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



039026CB1 



<400> 54 

ggccgcggct 

ccgccagcct 

attggagaca 

gcatcagaaa 

ctgatcagat 

gctttttgga 

gttcagaaca 

aggaatatca 

cacgatagag 

tggaaatgag 

ctgtcatgac 

ataatcacaa 

ccttatgatt 

tgactgttaa 

ttactcaaag 

<210> 55 
<211> 2336 



cctgtccaga 
tgtactgatg 
ttcaagcaaa 
aggtgactaa 
aaaacagttt 
ctgtgttaaa 
ttgcttacag 
tattcagcag 
aagtcctgat 
gactcatctg 
tgtttggcaa 
tagaaggtct 
cagcagcttg 
ttttaa'agca 
taaaataaat 



ccctgaccct 
tcggatgcga 
ggttggacaa 
taaaggtacc 
aaggaatttc 
gacttcacaa 
aaatatttaa 
aatgaagccc 
ggatgaactt 
atagaatccc 
atggaaaccg 
tattgttcag 
gtcacttgat 
acttatgtgt 
ggaaatatca 



ccctcccaag 
gagcctgtgc 
ctacttttcc 
agaagaatat 
tggggaccta 
caagagaagt 
aaatgacaca 
tggcagccaa 
ttgatgaaag 
ctgaaagcag 
ctggagaaac 
tgaaataata 
tagaaaaata 
tcgatcatgt 
ctgaaaaaaa 



gctcaaccgt 
ttaagtaaga 
agaacagaaa 
ggctgcacaa 
caataaactt 
aaaacctgaa 
aagaatatcc 
agcaggactc 
attgccaaca 
tagccaccat 
aaaattgcta 
agatgcaaca 
aaccattgtt 
atgagataga 
aaaaaa 



cccccaacaa 
atcaggcctt 
ggaaactcat 
ataccagaat 
acagagacct 
gagaccacct 
atgagatttc 
cttggccaac 
gctgctttat 
gttcaaccat 
tttaccagga 
tttgttgagg 
tcttcaattg 
aaaattttta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1120 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

886 
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<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 260607CB1 



<400> 55 

t?taaalr^f I!?™ 999 * atttg 9 cccc cgagcagtaa ttcggcacga ggaccatctc 60 

r2£ 5*?" tctttgaaac acataaccaa aatggtttga ttcactgact 120 
attaccftt? Zllnttr 9C "9 ttacacc ccaaatggct ttaatcccct ctcgggtctg 180 
fcclttttta lilt 999 tt 9 t 99 accc agctcctgtg aggggtctgg ttaggagaga 240 
«ttate« a a ?? a ? a ?" a 9"" a «gc ccttttctac tttcctcccc tcctcccagt 300 
"""f aa 5 ctt 5 ttccct ttttcctgac cccctccttc tggaggcagt tgggagctat 360 
"3"r ^"f actat twcagaaaa gaccccattt aaaacccaga gaacaccgga 420 
™™ 9 ctagttggtt ctgtgtccat tttcctctgt gccaaagaca gacagacaga 480 
Wtalaoaaa I^of l eC t gaatcaaagc aatagccagc tttcgacaca lacc?ggc?g 540 
Itlltmin aaggcct = ct ggaaactggg agctaagggc gaggcccttc ccttcagagg 600 
tctatcaoc? e£22? t9t ggtgtttgcc aagccaaggg gtagggagcc gagaaattgg 660 
cctatta?™ -fH 9 ^ 9 " ctttggggaa ggagaggaag tttggggctc caggtagctc 720 
t"Iaacc?a S " 9 tc «ctgccc ctactgcaga gatagcactg ccfagtEccc 780 
"f a9 ?^"? gcagac 99gc agcgaggagg ggcctcagtt agctctcaag ggtgccttcc 840 
"c=„"Sr a = cca gacat accctctgcc aaactgggaa ccagcagtgc tagtaactac 900 
9 9 cccagagggc ctgcttgagc cttcttgctc cacaggagaa gctggtgcct 960 
Iao?at aa af " ttCOt " C acctctca tc aggggtgggg gtteScttt ctt?l«ctg 1020 
a tfr?™» ggggagatcc tagtggcttt gccattcaaa ccactcgact gtttgcctgc 1080 
aaaaal 3333 SSlE* 98 ? 9 ggaaa <=agca cagcctgtca cagtaattgc aggaagattg 1140 
aa?tt a a a ^ ™ 3 5 9 cca 9399aca taaaagccat ttcccttcca aacactcgac 1200 
tattt 3 ?ttc ^? aa ^ a " t ctct 9tattc agacttagag taacaccagc tgaaaactgc 1260 
tcatatctaa S» ataaggcttc tctatcgggg tacgggacag ggaggaggcc 1320 
„et?t«™» a f?= 9 ? a B Ct aggggcgaga gccgcagccc Cgaccctcgg tcctgtgcac 1380 
aaacaaffal ™ fftC ^?f Cg 9 c 9«ttt gctggcgcct tagtatggtt gactccggat 1440 
«« a t a aa a t^™^ tttttcttga atgaaatagc aggaagctcc tcgggagcat 1500 
2tK™»?« ^ aa ^ C9ta9t gatsgatgct acgagtataa atggattaac tacctcaatc 1560 
oaa a a a Ir 3a ga t tggaact aagggcaggg actcatgcat aagggtatga atcccagcca 1620 
^ a ™ a r^ a gttgaggctt gtgccacaaa aggtttgtcc ttggggaaca ggcaggcctg 1680 
ctt?!^^ ccccatatcg attgggctgg gagggctggc cgtgaggtcc ccactttctg 1740 
""?a^™ ccatgtgtca cccctttggc ctccagcttg tccctctctc actttctata 1800 
t^Hftt" ac = agatg gt gaggaaagga atggcctctc cccttctaga gggggctggc 1860 
tggagtgaga cctggggctt ggcctggaac ccaccacaca gccccaaagt caggaagcct 1920 
gagctgagac ctcttcaaca gggtttcttt gagatcctlc acltcclttg llio 
"?5 CttCaa c 99gggccca gttggctcta gaaggagaag aggtgaagca 2040 
2?™??™ ccctggggga gtctgagggc gcggtccttg gactcattca ggccgtcitt 2100 
?^ a9 "? 999 ga gttccact gggcgatccc agcccctccc cacccaccct ccaatggacc 2160 
attaoor^ 2!!: CCC ? tCt cacttctg " ttatccacct cttagcaaaa caatagataa 2220 
a o« a ?f= a != g ^ a 9 cccca cttgcttagg ttaggggggg aaaaagattt ctttttccaa 2280 
aggaaaaaaa tattaccttg agaatacttt ccaaaaaaca aaatttaaaa aaaaaa 2336 



<210> 56 
<211> 2200 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1429651CB1 

<400> 56 



S"" a cgcctggatc tcatccaact gacacagaaa ccctgtaagg atccagaggt 60 
aaafoco 3 ?? a " atggaga gcggcaccag cagccctcag cctccacagt tagatcccct 



120 



rT _ =1 ____^_i ----- —a v.cyv.v.v.i.i.ciu LtLtcacagi: cagaucccct x^u 

catenate cESSEE SS"S!2fS vacate gcggtgctag ttctgtactt 180 



240 



' ctggccgttg gactatggtc cacagtgaag accaaaagag acacagtgaa 

c?™^rr " gg " ggag gggacatggt gtggtggcca gtgggtgcat ccttgtttgc 300 

r?— 3 ?*?" ggaagtggac atttcattgg cctggcaggg tcaggtgctg etaegggcat 360 

cctal™ 3 ^ gaaC ttaatggc " gttttctgtg ctgatgttgg cctggacctt 420 

" ' tacactgetg gtcaggtcac cacgatgcca gaatacctac ggaagegett 480 

™ 9 ?^™ a ? aatcccca tcatcctggc tgtactctac ctatttatct acatcttcac 540 

caagatctcg gtagacatgt atgcaggtgc catcttcatc cagcagtctt tgcacctgga 600 
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tctgtacctg 

cctggccgct 

caccttgatg 

cttcttggcc 

tgccttccat 

tggaatgtcc 

tctggctgcc 

ggcgctgccc 

tcaagtggcc 

ttcggacatc 

gatgatggct 

cagcaccatc 

gctcatgatt 

ccctgtggtc 

ctacctgcag 

tgaaaagggt 

cctggacttt 

gaagagcatt 

ctccaccgtg 

gtttactcgt 

tcttaccctc 

gatggttcaa 

agtggtgaag 

ggccagagca 

ggacgtcaac 

gtgtggggtg 

aaagaaaaaa 



gccatagttg 

gtgatctaca 

ggctacagtt 

ctggctagca 

attttccgag 

atcccatccc 

aagaacctgt 

ctcttcataa 

tgtgcagatc 

gcgtatccca 

gtgatggtgg 

ttcaccatgg 

gtgggcaggg 

caggccagcc 

ccgcctgtgg 

gccttctggg 

atttacgtgc 

cactacctct 

agctggttca 

cacgaccccg 

tctcagaacg 

gaaaacacgt 

gccatcctgt 

gaagccatca 

ctcattttct 

aacccagggg 

taataaagct 



ggctactggc 
cggatgccct 
tcgccgcggt 
accggagtga 
atccgctgac 
tctggtactg 
cccatgccaa 
tggtgttccc 
cagagatctg 
aactcgtgct 
cggctctcat 
acctctggaa 
tgtttgtgct 
agggcggcca 
cggtggtctt 
gcctgatctc 
agcctcgatg 
acttctccat 
cagagccacc 
tggtccagaa 
ggatgccaga 
ctaaaaccca 
ggctctgtgg 
tagtttccct 
gcgtgagccg 
tccaaactct 
tttgtttacc 



catcacngct 
gcagacgctg 
tggtgggatg 
gaacagcagc 
atctgatctc 
gtgcacggat 
aggaggtgct 
tgggatggtc 
ccagaagatc 
ggaactcctg 
gtcctccctc 
tcacctccgg 
gctgctggtc 
gctcttcatc 
catcatggga 
gggcctgctc 
cgaccagcca 
gatcctgtcc 
ctccaaggag 
ggaacaagca 
ggccagcagc 
cagctgtgac 
aatacaggag 
ggaagaaaac 
cgccaccttt 
gtttctcttc 
aaaaaaaaaa 



gtatacacgg 
atcatgctta 
gaaggactga 
tgcgggctgc 
ccgtggccgg 
caggtgattg 
ctgatggctg 
agccgcatcc 
tgcagcaacc 
cccacagggc 
acctccatct 
cctcgggcat 
ctggtctcca 
tatatccagt 
tgtttctgga 
ctgggcttgg 
gatgagcgcc 
acggtcaccc 
atggtcagcc 
ccaccagcag 
agcagcagcg 
atgaccccaa 
aagggcaagg 
cccttggtga 
atctggggct 
agtgctccat 



ttgctggtgg 
taggagcgct 
aggagaagta 
cccgggaaga 
gggtcctatt 
tccagcggac 
catacctgaa 
tcttcccaga 
cctcaggctg 
tccgtgggct 
ttaacagtgc 
ctgagaagga 
tcctctggat 
ccatcagctc 
agaggaccaa 
ttaggctggt 
cggtcctggt 
tcatcactgt 
acctgacctg 
ctcccttgtc 
tccagttcga 
agcagtccaa 
aagagctccc 
agaccctcct 
attttgctta 
ttttttaatg 



€60 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2200 



<210> 57 

<211> 2823 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



2069971CB1 



<400> 57 

gaaagacata 

taggtgaaaa 

caacaacact 

tagaaagtgt 

caacaacgac 

tatcttgtga 

aaatgaacac 

cccagagaga 

aagcccacag 

tcagcaatat 

cttgctgttg 

ctcttctccc 

cgagcagaga 

ctagtgtacc 

atgatcatcg 

cagagatgcc 

tttgtgcatg 

atctacttta 

ctggtcttgg 

gtgccaaaga 

ctgtctgtgc 

catgttgcta 

cttagcaaaa 

gaccccgact 

ccaaatttga 

aggaaccaaa 

tccattcacc 

cagtagctgt 

aacagctgtt 

agaacacata 



cacacttcat 

caatgaagcc 

gatgtaggaa 

ataaataatt 

aacaacagcc 

atgataaagc 

ttcatagcaa 

gaccagagga 

aaaagggggc 

gcttcatttt 

tcacagatgc 

tgtggttgtc 

tccttggtgc 

tggcatgtga 

tttccagctg 

ttggccacaa 

cccttggaga 

agccagagta 

ccagcaccat 

gcctgaatta 

acagcctgca 

cagcagccag 

gctttacgat 

gccttttctg 

caggccacct 

ggaagaaatt 

atgaaggaag 

gctcaattgc 

tgtaactatc 

ggtaaatttg 



gtaatgctac 

aggtaatatt 

gctcattatt 

gcagtgctgc 

gcagctcatc 

tgccaagatg 

tgtggaactc 

gctggagtca 

gaatgagtac 

catgattgca 

tgcccacctc 

atcgaagcct 

cctgctctcc 

gcgcctgctg 

cgcagtggcg 

tcacaaggaa 

tctatttcag 

taaaatagcc 

cactatctta 

cagtggtgtg 

catctggtct 

ccgggacagc 

gcactcactc 

tgaagacccc 

tcaaacatgc 

catgtcatgg 

aggcactgag 

aggaatgtgt 

ggcaatacca 

aactcaggaa 



ctgcaagtct 

gcaaggaggc 

ttaatttctg 

tttgcttcca 

ctggccgtca 

tatgctttca 

caacagaaac 

ggaggcatgt 

gcctatgcca 

gaggtcgtgg 

ttaattgacc 

ccctctaagc 

atcctgtgca 

tatcctgatt 

gccaacattg 

gtacaagcca 

agtatcagtg 

gacccaatct 

aaggacttct 

aaagagctta 

ctaacaatga 

caagtggttc 

accattcaga 

tgtgactagc 

tgctatgcag 

tgcaatgcac 

atccatcaat 

atatagatta 

aattcatctc 

agtcttacta 



ccctagaaaa 

tgtaatttta 

gagcctttta 

aaactgggca 

tggagtttct 

cactagaaag 

cggtgaataa 

accactgcca 

agtggaaact 

gtgggcacat 

tgaccagttt 

ggctgacatt 

tctgggtggt 

accagatcca 

tactaactgt 

atgccagcgt 

tgctaattag 

gcacactcat 

ccatcttact 

ttttagcagt 

atcaagtaat 

ggagagaaat 

tggaatctcc 

tcagtcacac 

tttctgcatc 

attttatcta 

caattggatt 

ttcctgagtg 

ccttccaata 

gaaatcagtg 



gcagtttttg 60 
gcagacctac 120 
attttttctt 180 
gtgagttcaa 240 
tgaaagaacg 300 
aaggagctgc 3 60 
agatcagtgt 420 
cagtggctcc 480 
ctgttctgct 540 
tgctgggagt 600 
cctgctcagt 660 
tggatggcac 720 
gactggcgtg 780 
ggcgactgtg 840 
ggttttgcac 900 
cagagctgct 960 
tgcacttatt 1020 
cttttccatc 1080 
catggaaggt 1140 
cgacggggtg 1200 
tctctcagct 12 60 
tgctaaagcc 1320 
agttgaccag 1380 
cgtcagtttc 1440 
atagaaaata 1500 
tttatttagt 1560 
atatactgat 1620 
gagccgaagt 1680 
atgcatcttg 1740 
gaagggacaa 1800 
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taalaatalc tctacat^r aaacatca9a aacaaaaaat aaggagagcc aagtcaggaa 1860 
gtaatf^tt ttt«^?^ aac ^cacac tagaacttgg ttctctcacc aagctgtaat 1920 
Iclaattttc a«ta«^ gaattggaaa tatgtatgaa tatacagaga agtgcEtaca 1980 
cltgtttact tcaaaae^r f acattttgg "ataaatcc ctcttatttc caaattctaa 2040 
aatfcttaaa c atataatcac tgctcaaaag gaaacatttt cacctaccag 2100 

alcfgccgac aalallattc llnr?***** Cgtggtt S ta gagacccaga agtcttcaag 2160 
atttfcrtac 9 a 3ttgaccc caccaagttg ttgccacaga caatttagat 2220 

gtgaacac?a c^ta? 9 ,^ aaagca * ac g caaccaattc attcagtcca cgagcatgat 2280 
? ct "gtgcta gacattgggc ttagcattga aactataaag aggaaccaga 2340 
agagtgclg gccaaca"t?e " tgg " gca a «caacact atctgtggag aglalacS llol 
tllclallll ?ttaaaao i£ aaatCct ac 9tcactgg gtttggtttg gaaacctgga 2460 
ttgtclacca cSl, fcccagagat gcctctaaag atgagccata gtctagaaga 2520 
aSatcataa lllltTZnt "gagtggga cagctagata catacattgg cagctacaat 2580 
tgagcatroc S^? ^gtagtgggg tataaaagga aagcgatgga tatcgccgga 2640 
cHItSlaa ctct™« tcacgtcatt Qaggtgacag ctctgctgga ctttgaatta 2700 
a a « a =^f a H g ctctcca 99 a agacgaagaa gagaaggaca ttctaggcaa aaagaagact 2760 
aggcacaagg cacacttatg tttgtctgtt agcttttagt tgaaaaagca agafaclggg 5820 



2823 



<210> 58 

<211> 1491 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2329339CB1 

<400> 58 



aggltcttat S'f g tgcggcctcg gctggcggcg gcaccaggcc acagttgtaa 60 
«gcacaacc ct£££2? atggcagagg agcaggagtt cacccagctc tgcaagStgc 120 
ZllrtZtJl^ ctcacaccca cactgcgtga acaacaccta ccgcagcgca cagcactccc 180 
tStggtgaa aaacIfS " ggctctcc 9^acagcgg aaLddtc gafgttgtgc 240 
llltllllaa lllattt^r atcgag9CCC atcgcatcct gctggctgcg tcctgcgatt 300 
acggtgtgll ctalaataet f? 5993 "^ aggagatgga acaggaagag gtcctgltcc 360 
aglfcfalct atgtgccaaa tcctacattt catatacacc tccgagctgg 420 

afattalcca ^ a ? a9aCaC ^^tgc ctgccagctt cagatcccag 480 

a a ctc a acc a ll^lf. 9 Satttcctca tgtcctgggc ggacgaagag aacattctcg 540 
ttltcctcal llllfr^ ctgttt9act tgagccgcct gactgagcaa ctggacaccE 600 
a o a »^^ Sa aaacttt 9 t 9 gccttctctc ggactgacaa gtaccgccag cttccattgg 660 
Italia^ "^ C 5 CCtC a 9cagcaatc gcctggaggt ctcctgcgag accgaggcl? 720 
ttlllalatc c^« tCtaC cattata9cc tggagcaggt gcaggctgal cagltl?cgc 780 
cgcaololct a« aaa «» C c " gagaca 9 tgcggtttcc gctgatggaa gcEgaggtcc 840 
t«tgtllca c™ 3 ™ 9 Ctggacccca gccctttgag ggacacagtg gccagcggcc 900 
ggtcooactt Zctnrn^? agcctaca 9 c "agcccgca gagcccgcaa acggagctgc 960 
?Iaacaacca a»™^ g * gggcttc 9 ggggcattca ctccacgccg tccactgtcc 1020 
ccclglcccc ™ a f^ a ctaaacccct tactgggaga gtggaagcac ttcactgcct 1080 
ttgglaooal e2E"?£ C aacca ggg« tcgcggtgct caacaacttc gtatacttga 1140 
calaalflla „ " 9 C caaggatttc gagcagagtc ccgatgctgg aggtatgacc 1200 
t§t«Qttot fa^^^ tC cagatcca9t ccctgcagca ggagcacgcc gacctgiccg 1260 
algltgtaaa a??^?™ ° atctacgct9 tggcgggccg tgactaccac aatgacctga 1320 
gglaglt^a llttl^t Cctgccacca actcctgggc atacgtggcc ccactcaaga 1380 
gllagltlat ggcgcgacgc tggaggggaa gatgtatatc acctgcggac 1440 

gcaagcttat tccctttagt gagggttaat tttagcttgc actggccgtc g 1491 

<210> 59 

<211> 986 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2540219CB1 

<400> 59 

Sgccagtac act^^f a 5tccagggc agtggctcct gcggacagcc agcataccag 60 

ggacS anlZZ aC aaccact gtg aggaacgagg gcctgtgccg tgtggctggc 120 

gggcggcatg gcaggagcaa tttcttgggg gacagcgact cctatggatg tcgtgaaaag 180 
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tcgactccaa 
ccagagttac 

gcggggcttc 

ccgcggggac 
agtgttcccc 
ttggacctcc 
tggcccttct 
atccccaatg 
agctgagaag 
tcactcctct 
agtctcaagt 
ccgctcctgc 
tccagtgcgt 
tcaggtctgc 



gctgatgggg 
cagaaggaag 
cccatgagtg 
cacgcagtga 
tgggcctcat 
aagtggacat 
gatgcctggg 
cccactctgc 
tcctgaccat 
gtctgaggat 
aacacgtccc 
acccgtggat 
gctcctcact 
agcaaataaa 



tttatttaaa 
gtcttaaagt 
cggccatgtt 
cgagcccata 
ctctgcatgt 
caattagcaa 
atgcctcatg 
taggctggca 
gagcaccagg 
ggggaggggc 
cgtgcctcca 
cctgaggaca 
gtgcacttat 
tgaaag 



caaatataaa 
gtttttcaga 
ccttgggtac 
agcgccagga 
gaagccctga 
gcgtgggcta 
agtcactgat 
tcaaagagct 
gagccagaaa 
cagtgagctc 
gtctcctctc 
gcggtagcgc 
tagtgtctgt 



ggtgtcctgg 
ggcatcactg 
gagctgtcgc 
ggtgaacaca 
gagctgcaga 
ggatggtgca 
tcaagccctc 
ttccaagaaa 
ccacccagag 
tgggctcagc 
agcaccgacc 
cttcctcacc 
tgagtgatca 



actgtatctc 240 
tgaacgcggt 300 
tgcaggccat 360 
ggatgactac 420 
tgtttgccct 480 
gacactgacg 540 
caaggttctg 600 
tgtttggtcc 660 
aaacgttgct 720 
cactccctcc 780 
aggtttttcc 840 
gcacgctgag 900 
aatcacatcc 960 
986 



<210> 60 
<211> 4023 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2722462CB1 



<400> 60 

gtccggggcg 

gctctggggg 

aggcagcaat 

atgccaaggc 

tggccacggc 

tccggcttgg 

ttgaactctg 

ccatggtcaa 

ggcagatgtc 

ggtattgcgg 

tggacatcct 

tgttctcccc 

tgtttgtcct 

acccctggtc 

acctgcttct 

ctttgctact 

ggaggcctgc 

ggatgagtgc 

tgctcagcct 

agaaaaacag 

gtattaatat 

tcaacggggt 

agcacgtgcc 

aggttgccgg 

ccgtgacagt 

atttcatcgt 

accgcaacgt 

ccctgctcat 

tcttccccaa 

ccgtgggcaa 

tcctcaacag 

tggaaggcgt 

ttgtgaccat 

gaatgaacta 

aggaaatctc 

cctcctcgga 

attccatggt 

accaggagga 

aactcagcag 

agaccgttac 

tactctactg 

tttatataca 

tgtcttcctt 



gcgcgtatgg 

tgagagttcc 

ctacctccag 

gctggcggcc 

cctgctgctg 

catctatgtc 

catgaagtta 

gacctcggtg 

ccatgtgcgg 

tggcgtccgg 

cctgctgctg 

taacccttca 

tctgaccaca 

ctgcgtcttc 

ggctgtggtg 

gcacaagcga 

cggcatctcc 

cagggagcgc 

aaaggacttt 

agagcactgg 

ccttgtgaag 

ctggatcctc 

ctggagttac 

cctgggccct 

gttcgccttc 

ggtcctgcgc 

gctggacacc 

cttttactac 

ctgctgcaac 

caggaccgtg 

ctttgtgacc 

cacctctcag 

ggtggtgatg 

cagccgcaag 

caaagaagag 

tgtcaccagg 

gtttctggga 

gatccaggag 

cagtgcagcc 

ctagcccagc 

cgatgtacgg 

ggaagaaaaa 

aactccagaa 



cggcactatg 

ccctccagcc 

gaaggcgaga 

tacctctttg 

ctgctgctct 

cacgccaccc 

cgctggctgg 

ctggtggtgc 

gtgacccgag 

cgcaacctgc 

ctgttcttca 

gacccctact 

gccaatttcc 

ttcatcgtgt 

ttcgacacct 

accgctatcc 

tacaggcagt 

tatcttacct 

tacgatatct 

tttgatgagc 

tccaaggcct 

gtggagacat 

ctcgtctttc 

gtggagtact 

ctgggactgc 

cccctccagc 

atgttcgagc 

tccttcgcca 

acgagtacag 

gtggaggaag 

ctgtttgagc 

acctcccact 

acgatcattg 

aaccaggact 

ctggttgccg 

ctgctggaga 

cggcgatcaa 

tggtatgagg 

cccgccgccc 

gcccgaaagc 

aactgcggtg 

agacagacaa 

gccagtttgg 



ccatccacga 

ccgcacacaa 

acaacgacaa 

cacacaatca 

ccctgtgcga 

tggagctgtt 

gcctccacac 

agtttgtcga 

cactgcgctg 

ggcagatctt 

tgatcatctt 

tcagcaccct 

cagatgtgat 

acctctccat 

tcaatgacat 

agcatgccta 

ttgaaggcct 

tcaaggccct 

acgaagttgc 

ttcccaggac 

tccagtattt 

ttatgctgaa 

taactatcta 

tgtcttccgg 

tggcgctggc 

tgctgaggtt 

tgctgccccg 

tcgtgggcat 

tggcagatgc 

gctactatta 

tcacagttgt 

ggagccgcct 

tcgcctttat 

cggaagttga 

tcctggagct 

ccctctccca 

ggaccaagag 

agcatgccag 

agcagccccc 

cgtctcttct 

tgtgtacaca 

gatggggctt 

tgaggggtgg 



ctcccaggcc 

ctgggagatg 

gttcttcacc 

cctcttctac 

ggcccccgcc 

tgccctgatg 

cttcatccgg 

ggccatcgtg 

cattttcctg 

ccagtccctg 

tgccatcctc 

ggagaacagc 

gatgccctcc 

cgagctgtat 

tgagaaacgc 

ccgcctgctc 

catgcgcttc 

gaatcagaac 

tgctttgaag 

ggcgctcctc 

catgtacttg 

aggtgggaac 

tggggtggag 

atggaacttg 

cctcaacatg 

gtttaagttg 

gatggccagc 

ggagttcttc 

ctaccgctgg 

tctcaataat 

caacaactgg 

ctacttcatg 

cctcgaggcc 

tggtggcatc 

ctaccgggag 

gatggagaga 

cgacctgagc 

ggagcaagag 

aggcagccgc 

atgcaataac 

tactcacgta 

ggtttataac 

gggtgcggcc 



cccagtctca 

aattaccaag 

caccccaagg 

ctgatggagc 

gtccccgcac 

gtggtagtgt 

cacaagcgga 

gtgttggtac 

gtggactgtc 

ccgcccttca 

ggtttctact 

atcgtcagtc 

tactcccgga 

ttcatcatga 

aagttcaagt 

atcagccaga 

tacaagcccc 

aacacacccc 

tggaaggcca 

atcttcaaag 

gtggtggcag 

ttcttctcca 

ctgttcctga 

tttgacttct 

gagcccttct 

aaggagcgct 

ctgggcctca 

tgcgggatcg 

cgcaaccaca 

tttgacaaca 

tacatcatca 

accttttaca 

ttcgtcttcc 

acccttgaga 

gcacgggggg 

taccagcaac 

ctgaagatgt 

cagcagcgac 

cagcgctccc 

acaatagtat 

tatgcacata 

caccttgccc 

accaggtctg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 



45/60 



WO 00/78953 



PCT/USOO/16668 



agctcttcct actgtggaag 
ccagtcgact gcggggcttg 
tgtcactgct actgcttcag 
ccctgcggcc catgccacag 
gaaagaactc ccggtctcca 
tcttcgggga ccagtgccca 
ggccagagct cagttagcca 
txggctctgc aagcctccca 
tccacagtgg cctccttggg 
ggccgatgtc tgcaacccag 
ttgcgattta cccgttcaag 
gataccaggt ccttcacacg 
ccctgtggga cgcagcagcc 
tgctcacacc agagatttcc 
ctcagttaac gtcgccaccc 
tctgagcgtg gtggaggtag 
tcagtgcaga ggtcgctttg 
ttctgcagca ggagccgcag 
gggcattcct ggccagcccc 
tggacagagc ctccctctgg 
ccgaccccag gggagcagta 
aagctgggtc ccatagccag 
gagttggaca agtagcaggt 
tattgagact cactgtgatt 

ggg 

<210> 61 
<211> 2345 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc — feature 
<223> Incyte ID No: 2739264CB1 

<400> 61 

gaaaggaggc ggaactcggt gatctgactg gcggtttccc ggccggactg agaaggggag 60 
cgcgctgcgc gtcgcaggag taacctactt ggtctcctgc tttcgcgaca tggccttcaa 120 
ttttggggct ccctcgggca cctccggtac cgctgcagcc accgcggccc ccgcgggtgg 180 
gtttggagga tttgggacaa catctacaac tgcaggttct gcattcagct tttctgcccc 240 
aactaacaca ggcactactg gactctttgg tggtactcag aacaaaggtt ttggatttgg 300 
tactggtttt ggcacaacaa cgggaactag tactggttta ggtactggtt tgggaactgg 360 
actgggattt ggaggattta atacacagca gcagcagcaa actacattag gtggtctctt 420 
cagtcagcct acacaagctc ctacccagtc caaccagctg ataaatactg cgagtgctct 480 
ttctgctcca acgctgttgg gagatgagag agatgctatt ttggcaaaat ggaatcaact 540 
gcaggccttt tggggaacag gaaaagggta tttcaacaat aatattccgc cagtggaact 600 
cacacaagaa aatccctttt gccgatttaa ggcagtaggt tatagttgca tgcccagtaa 660 
taaagatgaa gatgggctag tggttttagt tttcaacaaa aaagaaacag agattcgaag 720 
ccaacaacaa cagttggtag aatcattgca taaagttttg ggaggaaacc agacccttac 780 
tgtaaatgta gagggcacta aaacattgcc agatgatcag acagaagttg ttatttatgt 840 
tgttgagcgt tcgccaaatg gtacttcaag aagagttcca gctacaacgc tatatgccca 900 
ttttgaacaa gccaatataa aaacacaatt gcagcaactt ggtgtaaccc tttctatgac 960 
tagaacagaa ctttctcctg cacagatcaa acagctttta cagaatcctc ctgctggtgt 1020 
tgatcctatt atctgggaac aggccaaggt agataaccct gattctgaaa agttaattcc 1080 
tgtaccaatg gtgggtttta aggaacttct ccgaagaccg aaggttcaag atcagatgac 1140 
taagcagcat caaaccagat tagatatcat atctgaagat attagtgagc tacaaaagaa 1200 
tcaaactaca tctgtagcca aaattgcaca atacaagagg aaactcatgg atctttccca 1260 
tagaacttta caggtcctaa tcaaacagga aattcaaagg aagagtggtt atgccattca 1320 
ggctgatgaa gagcagttgc gagttcagct ggatacgatt cagggtgaac taaatgcacc 1380 
tactcagttc aagggccgac taaatgaatt gatgtctcaa atcaggatgc agaatcattt 1440 
tggagcagtc agatctgaag aaaggtatta catagatgca gatctgttac gagaaaccaa 1500 
gcagcatttg aaacaacaac aggaaggcct tagccatttg attagcatca ttaaagacga 1560 
tctagaagat ataaagctgg tcgaacatgg attgaatgaa accatccaca tcagaggtgg 1620 
tgtctttagt tgacagttca caaacttgtg taaaggtttg tgaaatgcat cttcttactg 1680 
catcagacct tccttaagaa tgaaaccgac cacatggagg gaaaaagaaa acaattcttt 1740 
cttggattgg ttttttgaga agtttactga caaattactg ttcatcaaat ctgaaacagt 1800 
cacctcacag ctcttcaaag aaaacctttg aaagatttat atctaaaagc tgtatttact 1860 
ttaaaagaag tgcataatta ccaaaattgt atgtactatt gtacattttt acaacagcac 1920 



gctccagaag gcccttcaca 
cccctcacgt gggctggcct 
gctcacaccc ccccgacctg 
gtttctgtgt tttgctttag 
gggtggtatt tcagtgtctg 
ggatctaatg gaagcggaat 
gtgccgggcg gccacagatt 
cccagcctitc tctggcttaa 
gacccagaac ccggaggaag 
accacttcgt ggaatgggct 
caaaacaacg ttttggttaa 
tgtgcactag gaacaggagc 
ccgtggcccc ggcccagttc 
acagcaggag cggttggtgc 
ctcctcccac catggtaccc 
gagggttgcc agctgcagtg 
gttccgcttc cctgggccac 
tggcaggatg gagggtgcga 
ggccctctgg tgcctgcttc 
ccttgtcctc ttcccagcca 
cctggtcccc caccccctcc 
cacggcatgg ttctcccctt 
gtttgttttt aaagcacagc 
cccccgggag tcagactggc 



aggagacccc tcacctggat 2640 
ccatcggcca cgtccaaagc 2700 
atggcgtgcc cgccccctct 2760 
ggacagaacc acttaggaaa 2820 
tgataatgtc acgcaacacc 2880 
tggggcaact gggcccgtgt 2940 
acactgacca atctcctccc 3000 
cccttgttgg cgaaaactct 3060 
gggcatgagg caggaagtgg 3120 
cttgaccaaa tccctttttt 3180 
ctaaggattg tgctaaagcc 3240 
gaacagcaca gagagacgct 3300 
ccagccaccc tccctggctc 3360 
agaagtaggt tcagatgaac 3420 
tgtaggagcc ctgtatgaca 3480 
accctgccac agaggcaggg 3540 
agaacggaac acagcatagg 3600 
agggcaagga gtgcactgct 3660 
ctgtgacttc agaaggcagg 3720 
cagaacgggc agggtggcac 3780 
tcccaaccac ctccaaggcc 3840 
ccccccttcc caggtcaggg 3900 
cctttgggaa agcaacacat 3960 
tttgtctctt tctctctgga 4020 

4023 
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tttcttaaac ataatctgtg 
ataccaagta aatatgaaat 
gctgctggag atttgtactt 
ctgtgggagg agtaaagaaa 
tatgaaaatg tatttttata 
attaggatat aacaaaacta 
tatttctaaa aagatgaaaa 
aaaaa 

<210> 62 

<211> 2085 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_ feature 
<223> Incyte ID No: 27583 10CB1 

<400> 62 

cggagatgtg gcgaccgttt ctggcatcat tctgagactc ggcagttgct tctcactgct 60 
gcggccgggc ctgtctgtgg gagctgcatc ctcctcatct gcaggcgctg gaaaaccaga 120 
cacgatcgga catgcatgtg gttctgcggc caaagcacgc cctttggttg tgaacttcat 180 
gatacctgtg tgcagctctg tcatttccac tctgctctgc tgcacagaag gcagaagccc 240 
tggccgtccc ctgctgtgtt cttcaggaga aacgtcaggg gccttcctcc aaggttctcc 300 
agccccacac ccctgtggag gaaggtgctc tccaccgcgg tagtgggggc gcccctgctc 360 
ctcggagccc gctatgtcat ggcagaggca cgggagaaga ggaggatgcg gctcgtggtg 420 
gatggcatgg ggcgctttgg caggtctctg aaggtcggcc tgcagatctc cctggactac 480 
tggtggtgca ccaatgttgt ccttcgaggg tggaagagcc caggctactt ggaggtgatg 540 
tctgcgtgtc accagcgggc ggctgatgcc ctggtggcag gggccatcag caacgggggc 600 
ctctacgtga agctgggcca ggggctgtgc tccttcaacc acctgcttcc ccccgagtat 660 
acccggaccc tgcgcgtgct agaggacagg gccctcaagc ggggcttcca ggaggtggat 720 
gagttgttcc ttgaggactt ccaggccctc ccccacgagc tcttccagga gtttgactac 780 
cagccaattg ctgccgccag cctggcacag gtgcacagag ccaagctgca cgatggcacc 840 
agcgtggctg tgaaggtgca gtacatcgac ctgcgggacc gctttgatgg ggacatccac 900 
accctggagc tcctgctgcg gctcgttgag gtcatgcacc ccagctttgg cttcagctgg 960 
gtcctccagg acctgaaggg gaccctggcc caggagctgg acttcgagaa tgagggccgc 1020 
aacgcagagc gctgtgcgcg ggagctggcg cacttcccct acgtcgtggt gccccgcgtg 1080 
cactgggaca agtccagcaa gcgcgtgctc actgccgact tctgcgccgg ctgcaaggtc 1140 
aacgatgtgg aggccatcag gagccagggg ctggcagtgc atgacatagc agaaaagctc 1200 
atcaaggcct ttgctgagca gatattttac accggcttca tccactcgga cccacatcct 1260 
ggcaacgttc tggtgcggaa aggcccggac gggaaagcgg agctggtgct gctggaccac 1320 
gggctctacc agttcctgga ggagaaggac cgcgcagccc tctgccagct gtggcgggcc 1380 
atcatcctgc gggacgacgc cgccatgagg gcgcacgcag ccgcactggg ggtgcaagac 1440 
tacctcctgt tcgccgagat gctcatgcag cgccccgtgc gcctggggca gctgtggggc 1500 
tcgcacctac tgagccgcga agaggcggcc tacatggtgg acatggcccg cgagcgcttc 1560 
gaggccgtca tggcggtgct cagggagctg ccgcggccca tgctgctggt gctgcgcaac 1620 
atcaacaccg tgcgcgctat caacgtggcc ctcggcgccc ccgtggaccg ctacttcctt 1680 
atggctaaaa gggctgtccg gggctggagc cgcctggcgg gcgccacgta tcggggtgtc 1740 
tacggcacca gcctcctgcg ccacgccaag gtcgtctggg agatgctcaa gtttgaagtg 1800 
gcgctcaggc tggagacctt ggccatgcgg ctgaccgccc tcctggctcg tgctctggtc 1860 
cacctgagcc tcgtgccccc agcggaggag ctctaccagt acctggagac ctagggcgca 1920 
gccgcccagg gccggcgggg cccttttcac cttgggctga cggaggtggc ggggctagag 1980 
gtgtagacac cccgagcccc gtgggcactc gcactggggg gctgtgacag cagctgggcc 2040 
aggaggccgt gtaatgacca cacactcctc tcaagcaaaa aaaaa 2085 

<210> 63 

<211> 3014 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_ feature 

<223> Incyte ID No: 2762348CB1 

<400> 63 

ggaggcaaag gccaggctga ggatcagggt ggcccgggtg gcagcgggga ggcgctgcat 60 

gctggaggct gtgctgagtg cccggtgcag gtgagccggt cctgcggagt tgtgccgagt 120 

gcctgctgca gaccgaggcn gggccaagat ggcgtctgtg tttcgaagcg aggagatgtg 180 



tttaatgatt attgtccatt 
aatcxacttt gcacataaca 
gagtataaat gtacaccagt 
atccaaaagc atttaatgtt 
ttattttact tatttggaat 
cttaaccatt tttgcaacca 
cttataaata aattcttaat 



gagcctgtac tctgctttcc 1980 
gaacaaacta taattacttg 2040 
ttttgtattt gtgaactcat 2100 
ttgtttttgt tctataaaga 2160 
ttacagagca cacctaagca 2220 
ttttgttttt taagcctttt 2280 
ttgtaattac ttttaaaaaa 2340 

2345 
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tttgtcacaa 

gctcggattg 

tgtgaatgaa 

gacgcaaaat 

ggaaatgatt 

ccagaaccag 

gaagaaaacc 

ggacacttct 

gttcatagcc 

gcgagtctgc 

tcctgtgacg 

gctcaggcag 

agagcctgcg 

aatcaccgtc 

caactggcac 

catgtgcaac 

agatgccaca 

ggcccccatc 

taaattcaca 

gataaaccca 

agactgtggt 

acgcttgctc 

tctgatccta 

ctccaagtcc 

cacatggaat 

aggagtgtat 

aaacaaactc 

ccagatggtt 

caacatcatt 

ggttttcatg 

cagcatcctc 

acccctctac 

tgtgccgtgg 

gctgcaggca 

cccttctagc 

tggagaagag 

ctgcctgggc 

tcatgcacaa 

aggctgggga 

agccatcctt 

ggttgagttc 

taaacacatc 

tcaccatgcc 

aatgggaagg 

cccgtggatt 

ttcatgtggg 

aacagatgtc 

ttcaaaaaaa 



ctgtttctcc 

gttcagttca 

gtcagaaggt 

gagattgtag 

accctggaga 

caggccttga 

caagacttct 

ggcctcctgg 

gggtgtgatc 

cgaggaaacg 

aaagaagaaa 

aaaatcaaga 

gtggagcgca 

acaacacaaa 

tcctggctca 

atcgacgtca 

cgtatcaaga 

atgaccacag 

gctggcttcc 

gccccctaca 

catggaaccg 

tcccagaaga 

cttatgggca 

ttgaacatct 

actcatgtaa 

tttggaaatc 

acatttctga 

tttggtgtca 

ctgcaattta 

atcattttca 

atccacttca 

aaacatcagc 

atgcttctga 

tccaggatcc 

cgttctggcc 

ttcaactttg 

tgcatttcaa 

ctgtctgaag 

ggaatcgtcg 

ctgatcatgg 

cagaacaagt 

ctggatggca 

aatgaaggaa 

ttgttcttgg 

tgatacgact 

gtattaattc 

ttcttgcaat 

aaaa 



aggtggaagc 

aagatttaaa 

gtgaatcact 

ttcagttgct 

ctgttctaga 

aacaaagctt 

ttgagacgga 

agttgaaagc 

caacagggaa 

tgtacttgaa 

ttcagaagca 

agatctgtga 

gagagatgtt 

cagagtctca 

tcaaggtgca 

cccagcagtg 

gggcactgga 

tgcaatctaa 

agaatattgt 

ccatcatcac 

tgatgctcct 

cagacaatga 

tcttctccat 

ttggctcttc 

tggaggaaag 

catacccgtt 

actcgtataa 

tcctcagcct 

tccctgagat 

aatggtgctg 

tcaacatgtt 

aagaagtcca 

ttaagccgtt 

aagaagatgc 

agaggacttc 

gagacgtctt 

acacagcctc 

tgctctggac 

gggtttttat 

agggcctctc 
tctatgtcgg 
cagccgagga 
gttcagtctt 
ctcacctgaa 
taaccacgtc 
tcaaataata 
attttaaata 



tgcatattgc 

tatgaatgtg 

ggagagaatc 

cgagaaaagc 

aaaactggaa 

cctagaactg 

aaccaattta 

agtgcctgca 

gaggatggct 

gttcagtgag 

catattcatc 

tgggtttcga 

ggagagcgtc 

ccgccagcgc 

gaagatgaaa 

tgtcatcgcc 

gcaaggcatg 

aacagcccct 

tgatgcctat 

tttccccttc 

ggctgcactt 

gatttggaac 

ctacacgggt 

ttggagtgtc 

tctatatctg 

tgggattgat 

aatgaagatg 

tttcaatcac 

gacttttatc 

ctttgacgtc 

tctgtttaac 

aagtttcttt 

tattcttaga 

cactgagaac 

tgcagatacc 

tgtccaccaa 

ctacctgcgg 

tatggtgatg 

tatttttgcc 

tgctttcctg 

ggatggttac 

gtaggctgag 

gtctttgata 

gcatgaaact 

agaggaagga 

aagtaattga 

ttgtatttga 



tgtgtggctg 

aacagctttc 

ctccgttttc 

ccaccgaccc 

ggagagttac 

acagaactga 

gctgatgatt 

tatatgaccg 

tcctttgagc 

atggacgccc 

atattttacc 

gccactgtct 

aatgtgaggc 

ctgctgcagg 

gctgtctacc 

gagatctggt 

gaactaagtg 

cccacattta 

ggtgtcggca 

ctgttcgctg 

tggatgattc 

accttcttcc 

ttgatctaca 

caacccatgt 

cagctggacc 

ccgatttgga 

tcggtgatcc 

atatacttca 

ctgtgtctgt 

cacgtatctc 

tacagtgact 

gtggttatgg 

gccagtcatc 

attgaaggtg 

cacggggctc 

gccatccaca 

ctctgggccc 

aacagcggcc 

gtatttgctg 

cacgccctgc 

aagttttctc 

ggctgcacct 

tcagcccctg 

gtgtattatt 

ctttggcaag 

caaatgaggg 

gaaaataaac 



agctcggaga 240 
aaaggaaatt 300 
tggaagacga 360 
cgctcccacg 420 
aggaagccaa 480 
aatacctcct 540 
tctttactga 600 
gaaagttggg 660 
ggttactgtg 720 
ctctggagga 780 
aaggagagca 840 
acccttgccc 900 
tggaagattt 960 
aagccgctgc 1020 
acatcctgaa 1080 
tcccggtggc 1140 
gctcctccat 1200 
acaggaccaa 1260 
gctaccggga 1320 
tgatgtttgg 1380 
tgaatgagag 1440 
acgggcgcta 1500 
atgactgctt 1560 
tcagaaacgg 1620 
cagccatacc 1680 
acttggcttc 1740 
tgggaattgt 1B00 
gaagaactct 1860 
ttggatacct 1920 
agcacgcccc 1980 
cttccaacgc 2040 
ctttgatttc 2100 
ggaaatccca 2160 
atagctccag 2220 
tggacgacca 2280 
ccatcgagta 2340 
tcagcctggc 2400 
ttcagacgcg 2460 
tcctgacagt 2520 
gactgcactg 2580 
cattctcctt 2640 
cccacggtgg 2700 
caaggcgctc 2760 
tggacgtcag 2820 
tgatattgtc 2880 
gagaatgcta 2940 
atctgagtca 3000 
3014 



<210> 64 

<211> 1726 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



3715961CB1 



<400> 64 

tgaaaatgcc 

cacatagagc 

tttctaatca 

gagtcctgaa 

atcagacagt 

caagaagccc 

gactaggcca 

agccaaagtt 

acatatgaac 



tgctgcgtac 
ctgcctggag 
atatgataaa 
agacaaggag 
gtgtgggaag 
acgtgttaga 
tgtcggaatg 
acctttttaa 
atctccgaga 



caaggtatgt 
aagctcatgg 
aaggtcagag 
atgttagtaa 
ggcagagtct 
agcaccggcc 
tcctcttatt 
tgatcttttt 
ggttgtgttt 



actagggcat 
tctgatggaa 
agcagtttct 
agcagagaag 
gaaaagagca 
ccatgagaca 
ttatacatac 
tacccagtgt 
ggcaaaagtg 



ctggggtaag 
agataagcaa 
gaaaaacatg 
ggagaattca 
tgccccattt 
aagacacagc 
ataagcacat 
attctggagg 
aagatgagca 



taaaaacaaa 60 
gaagagttaa 120 
tttttgagtt 180 
ttctagaaag 240 
ggagaagcat 300 
tagagagatt 360 
agatacatat 420 
tcgaatggtc 480 
ttatcccctt 540 
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tggaaatcag 
actgacctag 
ttgcgatttc 
gggctttggg 
ttaaccactt 
atcatgactc 
ccagccgatg 
cttttgtata 
agtctatata 
tggcttactt 
gatgcaaccc 
accccctatt 
gggcagcact 
aagtgatctg 
gaagaatcga 
ttctatgctg 
tgaagtagac 
ctgttatgtc 
gtacatactg 
cctagtgttg 



tcattggagg 
tgaaggttca 
gtggtgtaca 
caggctgggt 
atgatacagt 
acggtttatc 
tcatcaaaag 
aatcatcgac 
aaggcttttt 
atgaaaaaat 
ttaaagatac 
atttctacct 
ttattttttt 
gtcggatctc 
agcctgacca 
aagagcctgc 
gataggaatg 
agtatttatc 
taaattaaag 
aaggaaggtg 



gatgatggct 
gatgcaaatg 
tcacgcattt 
acccaatata 
gaaacactac 
aagcttatgt 
cagaataatg 
tgactgcttg 
accatcttgg 
cagagagatg 
agtgttcagt 
ctttaggaag 
ctggaaaccc 
acaaggccat 
ctttcacctt 
ttagaggagg 
tggaagaaca 
aatgaagttt 
ggaggtgaat 
tactctttct 



ggtgttattg 
gaaggaaaaa 
gcaaaaatcn 
caaagagcag 
ttggtattga 
tctggactgg 
aaccaaccac 
attcaggctg 
ctgagaatga 
agtggagtca 
attattgaaa 
acacctattc 
aagttctctt 
ccaatgagac 
gggcaagaag 
agtaccagga 
catacatagt 
gataattcac 
ggaaattaat 
tgtcagaaag 



gccagttttt 
ggaaactgga 
tagctgaagg 
cactggtgaa 
atacaccacc 
tagcttctat 
gagataaaca 
ttcaaggtga 
ccccttggtc 
gtccatttta 
tatgggcatc 
cacagagact 
tgactcctct 
cccgcacagc 
gtttggcctt 
gggagccagc 
gcctaagaaa 
ttttctgtca 
gaataaacat 
ataaaa 



agccaatcca 
aggaaaacca 
aggaatacga 
tatgggagat 
tgaggacaat 
tctgggaaca 
aggaagggga 
aggattcatg 
aatggtgttc 
aacccctaaa 
tgcaacacat 
gatttatagg 
ttttgtccaa 
attttctaaa 
tgagttgcta 
atttcagatc 
tacatttaac 
ttgttaaagc 
tttgagtttc 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1726 



<210> 65 

<211> 899 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



5108194CB1 



<400> 65 

gcggcggcga 

tcagcgcccg 

ctggtatggg 

tcctgtcctc 

ctggggccct 

tcatcttcca 

acacctacct 

gcgtggtgcc 

gcagctacta 

cactggctgg 

tggccgtaac 

gagaagaggg 

cctacgctgg 

gcggccgcaa 

ttacaacgtc 



cgcgtccggg 
cgcctgtcgg 
taatggtgtg 
gtccgtctca 
ggctggtgcc 
agtgtcttca 
caacgaggga 
ctacgccgcc 
tggcttccgt 
aacgacagcc 
cccgaaggaa 
gctgaagact 
cctgagcttc 
gcttattccc 
gtgactggga 



ccggtgaggg 
gctgaactga 
aaggaaggcc 
tcaaagcgtg 
cttgccaaaa 
aaaagatttt 
tttctcagct 
atccagttca 
ggagaagccc 
gcttcactga 
atgtacagca 
ctctaccatg 
ttcacctatg 
tttagtgagg 
aaaccctggc 



ggcgcggggg 
ggaccgagtc 
cggtgcgatt 
accacaggca 
cagcggtagc 
ctgccaagga 
tgtggcgcgg 
gcgcacacga 
tgcccccttg 
cctaccccct 
acatctttca 
gatttatgcc 
agacgctcaa 
gttaatttta 
gttacccaac 



gcgccggggg 
tcctgccatt 
gcatgaggat 
agtgctcagc 
tcccctggac 
ggccttccgg 
gaactcggcc 
ggagtacaag 
gcctcgcctc 
ggacctggtc 
tgtcttcatc 
caccgtgctg 
gagcttgcac 
gcttggcact 
ttaatcgcct 



ggcccaagcg 60 
ccgagcaggc 120 
gctgaggctg 180 
tccctgctgt 240 
cgaaccaaaa 300 
gtcctctact 360 
accatggtgc 420 
cgcatcctgg 480 
ttcgccggcg 540 
agagcgcgga 600 
cgcatctcga 660 

ggggtcattc 720 

agagagtaca 780 
ggccgtcgtt 840 
tgcagcaca 899 



<210> 66 

<211> 643 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5503122CB1 



<400> 66 

ctttaagctg 

ggattatcac 

tggttcagcg 

ccgggaagca 

gtccaaacct 

ggatgtgcct 

agagaaatat 

cttcattcca 

acgttttcac 

tgggacattc 

tacactgtat 



tagctgtggt 
acctccatgt 
gattcctaca 
attcgacagg 
gtagcatttg 
gttccaagca 
aacaatgatt 
agtccactca 
ggagggaaat 
cgagcaactc 
tccttttaaa 



ttctgcagca 
atgacaattt 
caagcaggcc 
agagagaaca 
ccgtgaagac 
cagctatctc 
ggtggatagg 
gactggagaa 
caagtggaaa 
ccacatcaac 
aatatttgaa 



attttgtttt 
gtacctgcat 
gtctgactcc 
gcaagcagct 
aaatgtgagc 
ctttgatgct 
aaggctggtg 
catacggatc 
ttcttcttca 
aggtgagggt 
cacacatgca 



tgccttgaaa 
ggaattgaag 
gatgtctctt 
atccagcttg 
tactgcggcg 
aaagactttc 
aaagagggct 
cagcaagaac 
agtcttggag 
tgtagttaaa 
age 



gaggtgetet 
acteggagge 
tggaagagga 
agagagcaaa 
ccctggacga 
tacatattaa 
gtgaaattgg 
aaaaaagagg 
aaatggtatc 
ctctttttca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

643 
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<210> 67 
<211> 2574 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> raisc_feature 

<223> Incyte ID No: 5517972CB1 



<400> 67 

gcgcctccca 

gaaggggttg 

caggacggaa 

agcctgtgga 

ttaagtggaa 

ttacatcctt 

ttggttaaga 

aatgtcgaag 

tccaatgacc 

cgagtaaaac 

tcgaatatca 

ggcaaatctt 

gatgttctga 

gtacaagatg 

gctcttcggc 

attcaagagt 

ggtgtgtctg 

tccatcttgt 

cttttgctcc 

cctcgatatt 

atgttccacg 

gaggcctata 

gtggcattaa 

gataagccac 

acaaaagctg 

aaggagatca 

ttcaaaaact 

ctgggactgg 

aacagagctg 

gtggaactct 

agagtgtcat 

ccaagtatta 

gccttcttcg 

ctggccatag 

tttgtgttta 

ctgtcatggc 

gaatttttgg 

tatgcaacat 

ggcttgtgga 

tacctgaaat 

tatcctcaca 

ttctgttccc 

tagaaatcac 



cgccggccgc 
tgccgcgcga 
gacgcagtgc 
ggaactgggt 
actgctgctt 
aattgttaaa 
ccgagctcta 
tttttatccc 
tgaaggcatt 
tgaagagtgg 
atgggatcat 
cgttattaga 
taaatggagc 
atgttgtgat 
ttgcaacaac 
taggtctgga 
gaggagaaag 
tcttggatga 
tgaaaaggat 
ccatcttcaa 
ggcctgctca 
ataaccctgc 
acagagaaga 
tcatagaaaa 
aattacatca 
gctacaccac 
tgctgggtaa 
ttataggtgc 

gggttctctt 

ttgtggtaga 
cttatttcct 
tatttacctg 
ttatgatgtt 
cagcaggtca 
tgatgatttt 
ttcagtactt 
gacaaaactt 
gtactggcga 
agaatcacgt 
tgttatttct 
taaaaaagaa 
ttgccatcac 
aacaaactga 



cgccgacgtg 

cggtctgcgt 

tcgccactcc 

aggatttagg 

tagagtttgt 

gcgctgcctc 

ttaagctgaa 

agtgtcacaa 

tactgaagga 

ctttctacct 

gaaacctggt 

tgtcttagct 

accgcgacct 

gggcactctg 

tatgacgaat 

taaagtggca 

aaaaaggact 

gcctacaact 

gtctaagcag 

gttgtttgat 

ggaggccttg 

agacttcttc 

agactttaaa 

attagcggag 

actttccggg 

ctccttctgt 

tccccaggcc 

catttacttt 

cttcctgacg 

gaagaagctc 

tggaaaactg 

tatagtgtac 

tacccttatg 

gagtgtggtt 

ttcaggtctg 

cagcattcca 

ctgcccagga 

agaatatttg 

ggccttggct 

taaaaaatat 

gcactttgat 

actgttgcac 

attaaacatg 



atcgctcggg 
gctgtgccca 
cactgagatt 
aacgcaccgt 
ttggaaggtc 
cgagcgcacg 
aagataaaaa 
ggaaacacca 
gctgtgttaa 
tgtcgaaaac 
ctcaacgcca 
gcaaggaaag 
gccaatttca 
acggtgagag 
catgaaaaaa 
gactccaagg 
agtataggaa 
ggcttagact 
ggacgaacaa 
agcctcacct 
ggatactttg 
ttggacatca 
gccacagaga 
atttatgtca 
ggtgagaaga 
catcaactca 
tctatagctc 
gggctaaaaa 
accaaccagt 
ttcatacatg 
ttatctgatt 
ttcatgttag 
atggtggctt 
tctgtagcaa 
ttggtcaatc 
cgatatggat 
ctcaatgcaa 
gtaaagcagg 
tgtatgattg 
tcttaaattt 
tgaagtattc 
agcagcaatt 
aaagaaccca 



cgcgccgggc 
ctcaaaaggt 
gagagacgcg 
gcacatgctt 
cgggtgactc 
catcctgaga 
ctctccagat 
atggcttccc 
gttttcataa 
cagttgagaa 
tcctgggacc 
atccaagtgg 
aatgtaattc 
aaaacttaca 
acgaacggat 
ttggaactca 
tggagcttat 
caagcacagc 
tcatcttctc 
tattggcctc 
aatcagctgg 
ttaatggaga 
tcatagagcc 
actcctcctt 
agaagaagat 
gatgggtttc 
agatcattgt 
atgattctac 
gtttcagcag 
aatacatcag 
tattacccat 
gattgaagcc 
attcagccag 
cacttctcat 
tcacaaccat 
ttacggcttt 
caggaaacaa 
gcatcgatct 
ttattttcct 
ccccttaatt 
aatcaagttt 
gttttaaaga 
agaaaaaaaa 



gtggttgggg 

tccgggcgcg 

gcaaggaggc 

ggtggtcttg 

atcccaacat 

tcctgagcct 

gtcttccagt 

cgcgacagct 

catctgctat 

agaaatatta 

cacaggtgga 

attatctgga 

aggttacgtg 

gttctcagca 

taacagggtc 

gtttatccgt 

cactgaccct 

aaatgctgtc 

cattcatcag 

aggaagactt 

ttatcactgt 

ttccactgct 

ttccaagcag 

ctacaaagag 

cacagtcttc 

caagcgttca 

cacagtcgta 

tggaatccag 

tgtttcagcc 

cggatactac 

gaggatgtta 

aaaggcagat 

ttccatggca 

gaccatctgt 

tgcatcttgg 

gcagcataat 

tccttgtaac 

ctcaccctgg 

cacaattgcc 

cagtatgatt 

ttttgttgct 

gatacatttt 

aaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2574 



<210> 68 
<211> 1571 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 



5593114CB1 



<400> 68 

??£??2!£? a cagcta S<?3t tcacggccac tggggcagag gagccgcgag aagatgtggg 60 

:""^ a ta cgggtccctg atctggaagg tggatttccc ctatcaggac aagctggtcg 120 

ccaatalnrZ f aactacag c aggcgcttct ggcagggcag cacggaccac cgcggggtcc 180 

ccggcaagcc tggaagagtc gtgactcttg ttgaagatcc tgcgggatgc gtatggggtg 240 
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a " gccagta SSaaaggaag aagaagtaaa agcatacctt gacttcagag 300 
cattcHtat arS 33 ^ acaacagtca "ctttatcc aaaagatccc IcaacaaLI 360 
e?)£™£ a "9Ctatat attggaacat gtgataaccc tgattatctt ggtcctgcac 420 
aata?ctt?t taa* 9 ^ 9 " caaatcttta acgcagccgg tccaagtgga Igaaatlcag 2 80 
a t^"" tgaacttgca aattctatta ggaacctcgc gccagaagaa gcagatgagc 540 
gc2atal?t tan?? 33 - 3 ttagtaaagg aacgtttaga agggaaalag LcctcLft 600 
ffr« a r aa ^ * agtc " cag agaattaact tcagtgcaca atgacaatat gatttggaaa 660 
aac?aaaalt oS^f" atttttaatg tagtgaggat attatttaaa cttttHttt 120 
a at«laata 1™!-^ a " tatttaa aatattggga tacagtgaaa gaaaaattca 780 
aa "^ a " a aca taaagat tccctaactt tatgttattg aacacttact cactagaagt 840 
Itcatcct« ?tca aa t aCa gtgaaggact cagttcagtc ttgtttttat cagag?ga?I loo 
a"ataaata?a a^rr-^ aatactattt tgaaattcta aacaattaaa ccaaaattcc 960 
ata aa ™ a aggtta ^ cc ttcaatatat tcctacacaa ttctgtaacc atggtttaaa 1020 
a taf a t aa ^ ^ aaaa 5 aa ° atgctta * aa atacacaata atatgaacag talltcagcc 1080 
aa a tt™«r a ""cttgt tattcaagta ttaaatgaaa tcttttgag? ttttagclaa 1140 
ttStalata aaoacatrr* Cgaaaatttc ^ttggaatta taatgtactg tacctcttct 1200 
tacaaatcat ££££"5 actatatgga aaataactca ctaaagcata aattacatta 1260 

gatcactaat gatgtagtct gtcattcact ttgtattaat cttataccaa 1320 
taaaacarct ftta^r? 3 h ta " acaaat taatggcaca tataatgaaa atttagtttt lllo 
£ aaaa f a » "tggaattc tttgtctgtc actatctcaa tttgtgtgtg tgtgtgtgta 1440 
caggctggaa aSglaalct EE""" tttctttgca ^clglglcl ggcla?clca £135 
tttgttttgt agt9taacct c tggcagaag ccaagaacag gcacctcctg gaattataat 1560 



1571 



<210> 69 

<211> 1549 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 044775CB1 

<400> 69 



t«Icaac?a cgcggctgga gtgccgcggg gagggctgtg ccggttgctt 60 

ccgggafccl S S a ^tctcctc gccgtccccg gggcgctgtg cgtctccagt 120 
aZtnfn^ agcc S cct g c cgtagcgggc ggccagatcc gcgtcccgcc tcagcggccg 180 
lllltmt ggga 9 a gaga atgagccaga gggacacgct ggtgcatctg tttgclgga? 240 
ScSt£?2 ttct^ 3 g « attctga catgtccact ggaagttgtl aaalcalgal 366 
ttctgtgacg ctttatattt ctgaagttca gctgaacacc atggctggag 360 
« a »H™ «gagtagtg tctcccggac ctcttcattg cctaaaggtg atcttggaaa 420 
cclllllllt Itllttr^t tttagaggac taggccccaa tttagtgggg gtagcflctt ill 
atcc a oa aa c ta™^ gcttattcaa accgcaagga aaagttgaat gatgtatttg 540 
ttctcccaaa ™S ? catatgattt «gctgcaat ggcaggcatt tactgtacat 600 
tota«« a ! aaaagagt 9 a Satcgtgtca tctcatgctc cccatccgca ggtcacctcc 660 
Sacftataa ^f^^ taaacctcg * tttactgcaa tcacagcLc ILccccatt 720 
al?tttaaa a aca g<="gar gcaaggaacc gcggggaaag gcgaatgggt 780 

ItSctactt M ^gtatcag acagatggac taaaaggatt ttataggggc 840 

atfaaa™ ^ 95 tatatca 9 a 9 actgttatcc attttgttat ttatgaaagt 900 

gtgaaaoaaa c^^?? 3 atataagact gcttctacaa tggaaaatga tgaagagtct 960 
gclacalctf t™?!?" tgtgggaat 9 atgctagctg ctgccacctc aaaaacttgt 1020 
laatacaoat ™ 3 ^ CC acatgaagtt Staagaacaa gactacgtga agagggaaca 1080 
aaa tatcata ^ r gactctatct ttgcttgttc aagaagaagg ttatgggtct 1140 

gccacctlta latt™^^ 'catctagtg agacagattc caaacacagc cattatgatg 1200 
Iclaaaaaaa «™2? tBBt " acctactc aatggatagc agcacgagga ctgctgtact 1260 
Icaoaaoaaa taran*^** gactacagt 9 gaccatggga tacagaagcc agcatggcag 1320 
tta^aafaa 3 r*^" 99 . gaacatgtaa ctattctaag tggaagtttt gttgtaggaa 1380 
caoaacctcc «™ C "f at tacttg 9 cct ttcggtaatg tgaaaaaaaa aaaaaaacct 1440 
ccatgtcccc canf 333 ^ " ttagaagc actcccctct caaaattgcc attttctcta 1500 
ccatgtcccc cagacacagt tgggttttgt tgatttatgg cagtcttct 1549 



<210> 70 

<211> 2237 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
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<223> Incyte ID No: 116588CB1 
<400> 70 

?cccct?!S alt™?- ' cagcgcact tctgtcttaa gctcctgggc ctccttactt 60 
ac«aaaoat c«S™^« a 9<=cacacct gtggatgttg ctagttactc cgcgtccgga 120 
ctfc?? a Iac tllllllat? agctgggc 9 a gttttgtggc actcctttgc tcttcagcag 180 
gattaccalc ?tgctgccct tcagactcaa gttaaacgag atatgcaagt 240 

IttctttggI aoctttttoa ' gctgtatgc ctggtattct tggcccaatg tagttttglg 300 
tagctac??t a?ttaL^ ta 9accgagt acctggaata cgatggggca caatcatttt 360 
ttggc?aata aaat?tl, 9 H*Sf? Bttgt ""^cctg ggtggaatat ttaatgcttt 420 
cclflafacl ? a to^r aa ga " fc 9 tatt tgggattggc ggcgagtccc tagcagttgc 480 
tcaUttaac ataorS g « ggtttaa aggcaaagaa ctaaacctgg tgtttggact 540 
ttctaaSalt tllltltr*? " ggaagcac agtaaacatg aacctcatgg gatggctgta 600 
^f 9 ^ ? C Caggtt «gc tggtcacaca accctcggga tcacacttat 660 
Kaglllaca a L? " "ctttcact aatctgtgcc ttggctctEg cctacttgga 720 
aaclaSata Itn^t t Ct ? ataaa 9 a acaaggaaaa acaggtgaag ttattaaatt 780 
ftaSf aag ? aCttC 5 cct tacccct gtggcctata tttatcatct gtgtctgcta 840 
atttSr?^ ^ 9ttccctt ttattggact tgggaaagtt ttctttacag IgLat?tgg 900 
Sccccggta ttfaaact^ gtg = aattaa cagtgttgta tatgtcatal cagctcccf? 960 
cgclgtlgcl gccl??ctta * g ? tggataa aa cagggaag aacatcatct gggttctttg 1020 
glttgctltg ?at a ttctal «?^~ 3C3t gatgctggcc tttacgatgt ggaacccttg 1080 
ggtaacatt? SL« l^?? gactctccta ctcattgctt gcctgtgcat tgtggccaat 1140 
Icaolatctt a™?^^ 9 aacatcagct gggaactgca tatggcttca tgcagtccat 1200 
gta?ttattt ? tcatttccat cattgctggt atgatactgg attctcgggg 1260 

Ittactctat ttao aa I=^ ^"cattgc ccgtgtttct ttgtcacttt t atctg?Ht 1320 
ggataaaata aalfr?^ g ' gcccagg 9 tgggaaccta aatcattctg caagacaaag 1380 
fgctlaacac at^tr™^ atactgaatg agaagttaaa atgaatgtgt catgagaatl 1440 
olaaaaataf r™!^ 99 " tgaaaacttc catttttaaa aatttagagt ttagtcatta 1500 
? a ata a aa aa rf? 3 ^" 33 agttatat " atatccaaat atacctattt caaagtgtat 1560 
tagtlglcta afcaacafrS tgtg6ctttt Stattgtgtg ttgctaaaga attclacttt 1620 
calaalloac aaagggttag aaaattgctg tggaacatcc aggtgaactt 1680 

ta?l a a a ftt aa™ 33 * 3 ggaaaa cgtt ggagcttctg ttgagataat cttcattagg 1740 
Jgatataaaa aaattl^t CCttttcttt atcttatagc aggaaaaaaa aacttttgal 1800 
II a a a c a Q a a clllltZZ^ acacaaa « a aacaatggca ttgtcatagg ccttccttt? 1860 
cacalacll? lalttn^ gggaatat 9t gaagatgttt ttatgaagtc tctttctgat 1920 
tclltttctt afttc?^ " a " ctgta gttatgtgga ttgccgagca atgacccltt 1980 
ataaactttc totl™^^ tactgaggac cctaatcact tagggatgta attttatagt 2040 
gataattacc £££££ ""tatagt ctaataagta aaaagtgtcc ttcaaattat 2100 
laaaaaalaa a S« S g^aaattaa aacactgcac acggagtaaa aaaaaaaaaa 2160 
ct aa g aaa tg g cc aaa | gag Cggccgcaag cttattccct ttagtgaggg ctaattttag 2220 

<210> 71 

«=211> 1114 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 875369CB1 

<400> 71 

rn~ aa ^ 9 = a gcagcccc 3S gggccccctt gaggcggcga ggccgcgaag ggcgcggggc 60 
tllclllltl ???aaf^ gaaagcgc « cagacgcclc ?cgcgg?ggl cgglggcclg 120 
cglgaaolco e^™^ 9 gcccgcggc 9 ccatggctca cgtcggctcc cgcaagcgct 180 
a1af?alctc atlZ*t?Z fggggacggg ggtcggaaaa gagaaagaag aagagcagga 240 
^In™™ ^aggaactgc tcggcctcca catcccaagg tcgcaaggcc agcacggccc 300 
™fo 9 I^£! Sgcctcacct tctccctgca tcacagagag aagcaailag alggcllgga 360 
IltcgtcctI ctceS^ * CCtcctc " cttccagctl ttctagltcc tc^cttlct 420 
waaaaaaaa aa™™ tccagtgatg gccggaagaa gcgggggaag tacaaggaca 480 
ala?acla aa S 39 aagaggaa 9 a agctgaagaa gaagggcaag gagaaggcgg 540 
gqaaaoflaa Si" ct 9ccgggcc cctcgctgga ccagtggcac cgatcagctg 600 
IglcIItglc clllalal^ gtcctgacgg atgagcagaa gtcccgaatc caggccatgl 660 
c?alaacaac llnll " 9 tgggatgccc ggcagagcat catccgcaag gtggtggacc 720 
cclaloalfo ™ 3 ^ a " c " attaagg gagatggcga ggtcctagag gaaatcgtaa 780 
fo aaa ^ 3a ^ acacaga 9ag atcaacaagc aagccacccg aggggactgc ctggccttcc 840 
lllllclllt llllllT^ ccctgagggc ccccgctggc clllgcctgt gglcgacgct 900 
ggcggcccag cctgggcagg tttcagggtg ccagtgggaa gcctgatggg tgctggtggc 960 
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ctttcccccg tggattggtc tctggcccag cccagtctct tctcaggggc agggggtgga 1020 
ggttggggtc accggcctgc ttggcacccc catctgaaag agcagcactt ctcagctatt 1080 
aaaggccccc tggatagaca aaaaaaaaaa aaaa 1114 



<210> 72 
<211> 998 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1325518CB1 



<400> 72 

!^o^ aaa gccat S tatt ctttcgtttc tctctaaaag aagaaaaata taatttaaaa 60 
a «^™? tattttctaa aacaataaat ttatagtgtt aatattcata gggtcaatca 120 
t aa ^ aa ?" tctcctttgg gcctgcattg tatgtgttgc ttttgcaagg aagagacggt 180 
^ tggtgaggat gacaatgacg atggtcaccc acttcatcca tctctgaata 240 

111 lilt 9g f atac ^ aat ttaccacctc ctctttatta tcgcccagtg aatacagtcc 300 
ZZZIZ taccc tgggaatact tacactgaca cagggttacc ttcgtatccc tggattctaa 360 
llllllll^ at 5 cccctat gtctatcaca tccgtggttt tcccttagct actcagttga 420 
atgttcctcc tctccctcct aggggtttcc cgtttgtccc tccttcaagg tttttttcag 480 
l*«ll%lt*l accc 9 ct 9cc ccacctattg cagctgagcc tgctgcagct gcacctctta 540 
cagccacacc tgtagcagct gagcctgctg caggggcccc tgttgcagct gagcctgctg 600 
^S^ a ^ C 5 g " 9gagCt S a 9cctgctg cagaggcacc tgttgcagct gagcctgctg 660 
r2E??-£™ tgttggagtg gagccagctg cagaggaacc ttcaccagct gagcctgcta 720 
cagccaagcc tgctgcccca gaacctcacc cttctccctc tcttgaacag gcaaatcagt 780 
r aaa i^^5 a ? aagagtac cat 9ggttca tttctatact gatgcagaaa taagtgaaat 840 
ctacaaaagt tttctttctt ttccaaagac tatttcattc tgttgtattc agagtattca 900 
«^ aC 5 aCa "gatttgtt tgtggtagtt tttccttgga cttaatttat attgaaaaaa 960 
cattgataat taaataaata aaatagataa tttagaca 998 

<210> 73 

<211> 2348 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2060987CB1 

<400> 73 

H?^f agC ° g 9 ctc g ccc gg gggccatggc agcagcggct actgcagccg agggggtccc 60 
^ a ;£™ 99g cctccc 5ggg aagtcatcca tctgaatgtg ggaggcaaga gattcagtac 120 
lllll 9 actctcacct ggatcccaga ctccttcttc tccagtcttc tgagcggacg 180 
^""^ ga ^ g ct 5aaagatg agaccggagc aatcttcatc gacagggacc ctacagtctt 240 
^™^ Ca 5 C ctcaacttcc tgcgcaccaa agagttggat cccaggggtg tccacggttc 300 
catgaagccc agttctatgg gctcactcct ctggttcgtc gcctgcagct 3 60 
tcgagaggag ttggatcgat cttcttgtgg aaacgtcctc ttcaatggtt acctgccgcc 420 
llltilil cca 5 t gaagc ggcggaaccg gcacagccta gtggggcctc agcagctagg 480 
aggacggcca gcccctgtcc gacggagcaa cacgatgccc cccaaccttg gcaatgcagg 540 
gctgctgggc cgaatgctgg atgagaaaac ccctccctca ccctcaggac aacctgagga 600 
gccggggatg gtgcgcctgg tgtgtggaca ccataattgg atcgctgtgg cctataccca 660 
nlllll-~ 5 gCtacaggt t 9 aa ggaagc ctctggctgg cagctggtgt tttccagccc 720 
llzZlzl 9 ^ t ^ cccatc g aacgactggc gctcacagcc cgggtgcatg gtggggcttt 780 
gggtgaacat gacaagatgg tggcagcagc caccggcagc gagatcctgc tatgggctct 840 
^ a ^?? aa ggcg ^ t ^ ct ccgagatagg ggtctttcat ctgggggtgc ctgtggaggc 900 
cttgttcttc gtcgggaacc agctcattgc tacaagccac acagggcgca tcggggtgtg 960 
^ a oS CC ? tC a f caa ^cact ggcaggtcca ggaggtgcag cccatcacca gttatgacgc 1020 
ggcaggctcc ttcctcctcc tgggctgcaa caacggctcc atttactacg tggatgtgca 1080 
ltt~l tcc " tfc gcgcatga aagacaacga cctccttgtc agcgagctct atcgggaccc 1140 
agcggaggat ggggtcaccg ccctcagtgt ctacctcacc cccaagacca gtgacagtgg 1200 
^ aa ««f gat ^ gagatc ^cct atggcaccag ctcagggggc gtgcgggtca tcgtgcagca 1260 
cccggagact gtgggctcgg ggcctcagct cttccagacc ttcactgtgc accgcagccc 1320 
^^«S aag atcat ^ctgt cggagaagca cctcatctca gtctgtgccg acaacaacca 1380 
Irlflt t tggtctgtga ctcgcttccg cggcatgatt tccacccagc ccggctccac 1440 
^ a "^?" tcctttaaga tcctggctct ggagtcggca gatgggcatg gcggctgcag 1500 
tgcuggcaat gacattggcc cctacggtga gcgggacgac cagcaagtgt tcatccagaa 1560 
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^*£9ccc agtgccagcc agctcttcgt gcgtctctca tctactgggc agcgggtgtg 1620 
tlllalim ^f^ 9 ^ctcacccac gacggccttc acagcgcfgg agtglgagg? 1680 
™ ctcggctctc ggccccggcg ctacctgctc aceggccagg ccaacggcll 1740 
tgggacctaa ccaccgccat ggacggcctc ggccaggccc ctgcaggtgg 1800 

ttclgc?^ tca^acct ^I aaC39Ct ggaacactgt gagctggccc cgccggltcf i860 
ccac?caacc gt = tccc <=ag cccctcaccc cgcatctccc tcaccagcct 1920 

ggctgaalcc SS* ^"ccttgtc tggccaccgt gggagcccaa gccccccgca 1980 
flf™ 9 ^ cgflegccgtg gtgggggcag ctttgtggaa cgctgccagg aactggtgcg 2040 
clctlccctc Icacctclca ? gc « cccac accagccccg tggccctccl gcgglltcgg 21o6 
accataatac ^fS™ agatgaagct caatgaaact tccttttgaa caacgcagct 2160 
golaccccla cctggtcctg ggggactcag gtgcctccct gattcctgtg 2220 

clccttc«? cttttlf^f agggcctcc t tggaataaat ggttattgtt actaggtccc 2280 
aaaaaala cttttct 99 a agccaaagtc accctcccca ataaagtcct cactgccaaa 2340 



2348 



<210> 74 
<211> 1139 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2172064CB1 



<400> 74 

ccaccccca? a a a!^^?° aggttcctgc ggcggctgct ggcggaggag agccggcgct 60 
etS™ E 99 ^ gCCtC Wttcccg tgctcctggg attccgcctt gtgctgctgg 120 
agccaaac?a llllll? 9 ™ " tggtgatg agcagagtga attcgtgtgt cacacccagl 180 
a9 ^ 9 ??" g caa ggctgcc tgcttcgatg ccttccaccc cctctccccg ctgcgttcct 240 
a?£cat*at ?S f " 9 gtgg «<*ac ccagcgccct ctatatgggl ttcactctgt 200 
ItZtlla^l ™?,« a " 3 aattatca g gaaaggggaa ggaggaggag accctgatcc 360 
alrtaaclca a^ a ^ aCa gat 5tcccag gggctggaag cctcaggctg ctctgggctt 420 
aclS?Itoa St^SST Cggcttgtcc ^gagggggc agccctgggg ttgcagEacc 480 
gtatlacc?a lllr^t 9 cccagctcet ttgcatgtcg ccgagaacct tgccttggta 540 
aaatc 3 ™ ^ c S«cctctg agaagaccat tttcctaaag accatgtttg 600 

oa?a» a ™ " tc ^tctc ttgtttactt ttttggagct tgtgcttctg ggtttgggga 660 
gcaccaafaa I™? 9 " 9 cacaaatctt cctcttctaa atacttcctl Ilttclgaga 720 
Itcaaalaa? S M g<=aaccgata gcctcccagt ggtggaaacc aaagagcaat 780 
aaa aa o aa3 S ag " ccagga agaagcttag cccaggaaaa acaaagacca gttggaccca 840 
I« a cIaaoI aaa-?^ 9 ^ gaactttggc caactttcct catcaccata cttlaaatcc 900 
aacaactaS fSSf"!" caccatttt <= tatacatgtg acatatgtag eagcataacc 960 
ctaataaHa J 9C <="gactcca cctctacata caatgactca gctaaccaga 1020 

aSgtcctc? t"? , ^ accattgctc agggaggcat tgctttgggg aattattccc 1080 
agcgtcctcc ttacttatcg caagtaataa aaatcccctg ggaaatcctc aaaaaaaaa 1139 



<210> 75 

<211> 863 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 22192 67CB1 

<400> 75 



tcc?aoaa a ? "J; CC ? CCgC agc «ggcac ctcaggtgcc gaccttctac ctgccctact 60 
Ir^-t" c 93cctggag gcggcagcct tggtcttcct gctcctgacg gcagattgct 120 
fccOactaa aa^^^f? « gctgggca ccatggtcac aggcctggcf LdtgcSc ISO 
""^"jg ggcccagtat ctgccaggct ggactgtgct gttcctctct gtcctggggc 240 
caatoltcaa o^ 9 " 9 ' 9 tccgca «<=a gcagcctctc cgcggccgag gtcttcclca 300 
ccSccccI? aoa^^?! ctgggcct 9g tgctgggggc cgggttcctg ggccaggcag 360 
ccfceeS ?£™ et0 cac =9 gcc ggc agggcttctt cctgcaacaa gtcgtcttcg 420 
Srctgcccla «cactSL C n "gctgtgtg tcctgctgct gcctgagagc cgaagccggg 480 
9 ?" 9 "^ a g tc actgcag gacgccgacc gcctgcgccg ctccccactc ctgcggggcc 540 
™™ ggaccacctg cctctgctgc cgccctccaa ctcctactgg gccggclaca 600 
ggcctfggac SewcSS^ tgCCtggtgg ^ctgggagc caggatgggl ccallgtcaa 660 
OSS ^™ 9 9 acccca gacg tctggtccag ggcagacaca ttcctctcag 720 
aagcccgtgt ctcagtgcag gtggagccgt ggggacagcg tgaaggtgtc tccagccagg 780 
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aaggc??Itg tgglaal^ J^tctccc cccagccaca cccagtaggt gtggaggata 840 

863 

<210> 76 
<211> 1322 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2308629CB1 

<400> 76 
ccggggggcc 
actccggccc 
caggcgccgt 
cccggatgca 
tctggaggat 
caacaggcgc 
cattgagtga 
gtgtggcaac 
tgcagatgta 
atgaaggggc 
tccaagccat 
ggtacaaccc 
ccacaacccc 
actcacacat 
taggtggggt 
ccacagccat 
aagagtggag 
gctgcatcct 
ggaaggtaga 
acctctgttg 
gcacctttga 
ggataagcag 
aa 



ggcgcgcggg 

ggactacgag 

ggcagggatc 

gagtctacag 

tataagaacg 

agggcctgcc 

tgtaatccac 

attacttcat 

caactcacca 

cggggccttt 

tcacttcatg 

aagctcccac 

actggacgtt 

tacaggacat 

gaccgcctat 

cgcatggtct 

ggctggcaag 

ggtcacattc 

ggggctctcc 

ccaccacctt 

taacctctct 

ctcattcccc 



gaggccgggg 
gcgctgccgg 
ctggagcact 
cctgacccag 

gagggcctat 

cacgcccttt 
cctgggggca 
gatgcagcca 
taccaccggg 
taccgcagct 
acctatgaat 
gtcctctctg 
tgcaaaacac 
atcacaggca 
ttccgagggg 
gtgtatgagt 
tgaagtagca 
tctgtctcct 
cccaggattt 
tccttccagg 
ccatcctggg 
tggttcctaa 



cctgcaggcc 
ctggagccac 
gcgtgatgta 
ctgcccgcta 
ggaggcccat 
attttgcctg 
atagccatat 
tgaaccctgc 
tgacagactg 
acaccaccca 
tcctgcagga 
gagcttgcgc 
tgctcaacac 
tggctagtgc 
tgcaggccag 
tcttcaaata 
ctgaacgaag 
ggaatgctcc 
tggtgttttg 
ccctaagcac 
cctgatgacc 
taaaaagcct 



cccggtacga 

tgtcaccacg 

ccccatcgac 

tcgcaatgtg 

gagggggctg 

ctacgaaaag 

tgccaatggt 

ggaagtggtc 

tgtacgggca 

gctgaccatg 

gcactttaac 

aggagctgta 

ccaggagtcc 

cttcaggacg 

agtaatttac 

cctaatcact 

ccaggggttc 

cacctcaagt 

actaacacca 

gtgcagcaaa 

tgctctagac 

ttaaattaaa 



caagatccgg 

cacatggtgg 

tgcgtcaaga 

ttggaggccc 

aacgtcacag 

ttaaaaaaga 

gcggccgggt 

aagcagagga 

gtgtggcaaa 

aacgttcctt 

ccccagagac 

gctgccgcag 

ttggctttga 

gtatatcaag 

cagatcccct 

aaaaggcaag 

agatgacact 

ggagttagaa 

gttcctgcca 

gcacaccaca 

tgttatagag 

aaaaaaaaaa 



<210> 77 
<211> 1869 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2660038CB1 

<400> 77 
cagggacgct 
ggctcgagga 
cttcttcctg 
gctcccgtcg 
ttgacctcag 
ccctgcttgc 
ccctgacacc 
aggccttccc 
gggtcagttc 
gcggggcctg 
gagaactgca 
tcaccatcgg 
tgtaccagtt 
gctacctgca 
tggctggggt 
aggcacaggc 
aggcgctaac 
gcggcctgcc 
aggacctcct 
ctgccatgat 
caaggctcta 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1322 



cattcctggc 
aatcaccggc 
cctccctggc 
cccgccatcc 
agggtcctgc 
cagctcttga 
cttgactgca 
cctgtcttcc 
ctgctacagc 
tgtattcacc 
ggcccctggc 
caaggtggat 
cctgatgaat 
cacagccgaa 
catgggagcc 
agcctcagaa 
cgagattggc 
ccgagttatc 
gagccagtgg 
gagtggcatt 
caaccagccc 



cagagtcctg 
tcttctcccg 
ttccatctcc 
tgagccatcc 
cttaagcttc 
ctcccaaatt 
aatctaaatt 
aaccccgctt 
tatcccacca 
aatcccctgg 
acataccagc 
ggccttgctg 
ggcatccgac 
gcgacccaca 
tacttgggga 
attgctgtag 
cagaaacatg 
gtcggttcct 
gagatctttc 
gcagttgtct 
acagatgcac 



acttctttct 

gatccttttc 

caaccccgcg 

cacctgcaac 

tcaccagaat 

ccagctgacg 

cttatcttct 

tctgacaccc 

tggacttctt 

aggtggtgaa 

ggcactaccg 

ccctgcagaa 

tgggcaccta 

gtcctgcccg 

gccccatcta 

ggcaccagta 

gtctggtggg 

ccacccagct 

ctccccagag 

tggccatggc 

agggcaaggg 



cggatcgaga 

ttctctcact 

attccctcct 

cttctgtctt 

ctcctagatt 

tttgaccact 

gcaacctgta 

atttttactt 

gatgagtggc 

gaccaggatg 

aaatgtcttc 

aggcctggcc 

tgggctggct 

cagcgcagca 

catggtgaag 

taagcatcag 

gttatggcgt 

gtgcaccttc 

ctggaagttg 

accctttgat 

cctcatgtac 



tctcgttgct 

tgctggcttt 

ctactcccgt 

ttgcccctcc 

tctatctctt 

tgacatttga 

ctgctgaaac 

tcttactctt 

ctggcagcct 

cagttgcaag 

catgccttca 

cccgccctct 

gaggctgggg 

gctggggcca 

acacacctgc 

ggcatgtttc 

ggggctctgg 

tcatccacca 

gcgctggtgg 

gtggcctgca 

cgggggatac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 
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tggacgctct 
gtgcctccta 
tgcgctccct 
agcactcctt 
ctttcatcca 
actacaagag 
tccctgggta 
tcccactcac 
ccctccaaaa 
ctgttagatt 
aaaaaaaaa 



gctgcagaca 
cttccgcctc 
ctactacaca 
ggccacttgt 
tccatggggg 
atgagtttcc 
actcacaccc 
cactgccacc 
ccaaagggaa 
caaaggcaca 



gctcggaccg 
ggcccccaca 
gacactaaat 
gcctccacca 
acagccaacc 
cttctttcct 
ctctctcagg 
cttgagtccc 
tcgagagacc 
gagattatat 



agggcatttt 
ccatcctctc 
aacagccgct 
ctatgtcctg 
ccactcccca 
tgggtgttgc 
gctgaacgag 
tcctgctccc 
caattcccag 
tgattataaa 



tggcatgtac 
cctcttcttc 
ttcccagtct 
gtgactactg 
tctgttctca 
tttaaacctt 
tcatcccaaa 
atgcacagtt 
gcgtctggga 
gcaagtttat 



aagggtatag 
tgggaccagc 
ccaccaaatg 
attaggtgac 
gggttgaatc 
ccctacccat 
gtgtatttcc 
ttaaactcct 
cccaggtgtc 
tctgaaaaaa 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1869 



<210> 78 

<211> 1881 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2670745CB1 



<400> 78 

gaagaaccga 

cgggttctag 

atagcagagg 

tgttacggtg 

ctgtcgggca 

tgcgtcagat 

cggtgaagtc 

caaccgtcct 

cctgcaatgt 

cagggacagt 

acatttttgg 

ataccagcac 

acttccactc 

gctttgggcc 

ccagaactga 

gccactcggc 

tgaaccggca 

ttgaaccgaa 

acaagattgt 

ttgaccttat 

tgtgcaaact 

srgtgggagct 

atgggtagga 

ctgcccatct 

gttgttgggc 

gggctaggtt 

cacctccttt 

gaagggaagg 

cctccccctt 

aaggagttgc 

gcagataaat 

ttttataaaa 



gcttggctgt 

gctgcaggca 

cagcaggccg 

gacagtgcac 

tcgggtatac 

agatgtgcac 

tgccatccgt 

catcgacgac 

gctctatgcc 

tcctggggcc 

gggctacgag 

catgacatgg 

agccacaatg 

attccattcc 

ggcttggctg 

ctttggctac 

cttccatgac 

ggggaagggg 

cctctttggg 

agatcattct 

ggccgtgatt 

gaatgccatg 

ggaagtttct 

gtcacccacc 

gagaggtgtt 

cctcccccct 

cagctgctcc 

gaatggggag 

tgcctatccc 

agctgttggc 

cccacccttc 

attaattaaa 



gtttatctcg 

gctcgaggac 

tgccgggggg 

ctggagggcg 

tccttcgggg 

attttcaatg 

gggcaagctc 

acagtcctcc 

tttgacgtca 

cgggatggac 

cagcaggcgg 

actcttatct 

ctgggaagtc 

aacaatgaga 

gactgtcccc 

aatggggagc 

ctctggaagt 

ccatgtcccc 

ggtaccagtc 

gacttacaca 

cagtataacc 

accaccaaca 

gccacctccc 

tgctcctttg 

ctctgtgctg 

tgggccgagg 

tgggcctcag 

aagggagaag 

ctcccctctg 

atgagacctc 

cttgagctgt 



ttggggacta 

ccgcggcccc 

gcatgttgct 

ggccccgcag 

gttactgctc 

cagtgtcctt 

ctgtggtacc 

tttggggcgg 

atacgcacaa 

attcagcctg 

actgtttttc 

gtacaaaggg 

acatgtatgt 

tttactgcaa 

cgactccagt 

tgtacatctt 

ttaatcctgt 

gccggcgcca 

catctcctga 

ttttggactt 

tagaccagtc 

gcaatatcag 

ctcctgagcc 

acccctggac 

tgaattcagt 

gccccttccc 

ctctgcccag 

caagcagtgt 

cttgagcctt 

cttctccccg 

cgctgtactc 



aggcgtcggt 

gccccggctc 

gtaaccagtg 

ggtgaaccat 

tggtgaagac 

gcgttggaca 

ctacatgcgc 

gcggaatgac 

gtggttcaca 

tgtcctaggc 

caatgacatt 

cagccctgca 

ctttgggggc 

ccgcattcga 

gctgcctgag 

tggtggttat 

gtcctttacc 

gtgctgctgt 

ggaaggcctg 

tagccctagt 

ctgtttgcct 

tcgccccatc 

tgctgtcatc 

ttggtatacc 

ggggagctgt 

cttggtgctc 

ggccagccag 

ctgagcctca 

gagccttgac 

tcttggggag 

tgaagttcag 



tggcgcgcaa 

ggcctggcag 

gcccagggga 

gctgcagtgg 

tatgagacac 

aagctgcccc 

tatggacact 

accgaagggg 

ccccgagtgt 

aagatcatgt 

cacaagctag 

cgctggaggg 

cgtgccgacc 

gtctttgaca 

gggcgccgga 

aatgcaaggc 

tggaaaaaga 

attgttggtg 

ggagatgaat 

ctgaagactc 

catgatatca 

gtctcctccc 

ttcactgccc 

tccatgtgga 

agcggggtgg 

tgtccccatc 

gttctgctgg 

ggagcttccc 

tgggagctga 

gtggggacca 

ccagctcaga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1881 



<210> 79 

<211> 2004 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



2676443CB1 



<400> 79 

cggStglgg J££2££ S^" 9 <=ccccggggc gcgcgcgcgc attggctgtg 60 

tgcggccgf? !c!co™« ^ctttga accgggcgcg gggcgcgggg cgcggggcgc 120 

gcggccggt acacgccggg gtagggccgg ggtcgggttg tggtcgggcc gggattgggc 180 
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tctcctgggc 

cctgcggggc 

agcaggaggt 

tcctggcact 

ctcggttcgc 

tccccttcca 

ggacgcccgc 

agcggcgcaa 

tcaggagact 

gccagctgtc 

acaacatctt 

gccagcccct 

ttttccactg 

tcccagctgg 

tacgcaaaaa 

caggccagac 

acgacctccc 

gctggctgag 

ggctaccagg 

ctcaggggaa 

tattatccct 

ggacttgggt 

cacttcgtgt 

gcgagacccc 

agggagtggc 

cccgcgcacc 

actcaccagc 

cccccccgct 

ttctccccga 

ggcttcaccc 

ggagctgcca 



catggcagcc 
cgcctgctgc 
gaagctgcgc 
ctacagcggc 
catctacgag 
cgagaaggtg 
agacttggtc 
ctacgcccat 
gttctcgggt 
ctgctacgac 
cactcacttt 
ggatgtgctg 
cgccgtggag 
catccgcctc 
ctttggcatc 
acgctaggtt 
tggccgtggc 
ctcctcctgg 
ctctcccggc 
caggggctac 
gcctcctgcc 
ggcagagtgt 
ctgctgagag 
gcagctgggt 
ccagggtggt 
gctgcctgcc 
aggcggtctc 
tggctccctg 
caaggagtcc 
cacctgggaa 
aaaaaaaaaa 



gaggcgcgcg 
acgcacccgc 
atgacgggca 
ctgagcgcct 
actgtgcggg 
ttgctgggct 
aacgtcagga 
gcgctggatg 
gcaaccatgg 
caggccaagc 
gtcgccagct 
aagactcgcc 
acagcgaagc 
atcccccaca 
aaagtgccat 
cttccaaaga 
cacccatcct 
cctcgtcccc 
tgggcactgc 
cagaggctga 
cccgatgccc 
gggtgcagcc 
caaccagacc 
gggatgaaca 
tctggagcca 
ctgcagtggc 
tgtcttcagg 

ggggaatggc 

gacggggcgg 
gggcagtgtg 
aaaa 



tgtcgcgctg 
tggacctgcc 
cggcgctgcg 
cgctgtgcag 
accgcgtggc 
ccgtcagcgg 
tgcagaacga 
gcctgtaccg 
catccagccg 
agctggtcct 
ttattgcagg 
tgatgaactc 
tcgggcctct 
ccgtgctcac 
cctgaccagc 
gtcccaagcc 
ccgcagcagg 
tctcagctgt 
gtggccttgc 
tttctcccct 
aaagcagcat 
tggctgttgc 
ttccatgtcc 
agcaacgcag 
ttgtgggtga 
tttaacagct 
gattgtgcct 
ccaggcgggc 
atgctgcatc 
ctctgtgggg 



gtacttcggg 
caaggtgcat 
ggtggcgcgt 
acagacgacc 
caagggcagc 
tttagctgga 
cgtgaagctg 
cgtagctcgt 
aggggcctta 
tagcaccggg 
tggatgtgcc 
caagggggag 
ggccttttac 
ttttgtgttt 
cgtgggaatg 
cagcacctgc 
cccctgctgt 
agctgcacca 
ccctctcccg 
ctcctgggcc 
cttccagcac 
tcacccaagt 
tcgggcagct 
accacaagcg 
gggtcgaggg 
agttttgcca 
gcgtccctcg 
cgcggttcct 
ctctgcctcc 
gctgcaatca 



gggctggcct 
ccgcagacgc 
accgacggca 
tactccctga 
caggggcctc 
ggcctcgtgg 
ccccagggtc 
gaagagggtc 
gtcactgtgg 
tacctctctg 
acgttcctgt 
tatcagggcg 
aagggcctcg 
ctggaacagc 
gctgggctgc 
tcctggggcc 
ccccccacct 
cccccgctct 
ctggcagctc 
aggggagggg 
tttccatcga 
gctagctctg 
gcaactcccc 
agtgcctggg 
ccaccgaggt 
aagcctctcc 
ggcacctggg 
ccttagggcc 
ctggtcgctg 
ataaacgccg 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2004 



<210> 80 

<211> 3555 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_ feature 
<223> Incyte ID No: 



3295764CB1 



<400> 80 

gaccgtgggc 

aagtgcctga 

cctcccctcg 

gcggccactg 

gcgctgctct 

ccgtacctgc 

ccagtatgga 

ctccgttata 

ctgctctatg 

acagccactg 

cagaaagtca 

ctagggcaaa 

cttacctgtg 

ctcttctttc 

ggtggcattg 

tcaaaaatcc 

cgtctccttg 

cttctctgct 

tacacacagg 

ggcgtggagg 

aaaatatcct 

gctgcagtgt 

gtcttcagaa 

ctcagcatgg 

cagacgctgc 

cagtttttga 



gagtccagaa 

ccctcactcc 

cgccccggat 

tgctcctgcg 

gcgcctacgg 

tggggccgga 

cttactctta 

aacctgttgt 

cccagggact 

aaattgccta 

caagttactg 

tccttgtctc 

tttcagtggc 

accacattcc 

ttactgacac 

ctctaaatat 

tattgaaagt 

ggtctgtgtg 

gcctgtggga 

ccgtttcaac 

ggtcaacttg 

atatcatgga 

tcatctacat 

aacgctatgc 

tcactctaat 

tctatgccag 



cgtcctggcc 
agttggcgga 
ggatgtgccc 
gaccgctcgg 
cttcttcgcc 
caagaacctg 
cctggtgcta 
tctactgcag 
gctggccatt 
ttactcttat 
tcgaagtgcc 
agtggcaggc 
ttttgctgtg 
ttctacctgc 
cccagcttct 
ggaggagcct 
actatggaat 
gtgggccctc 
gaaagtgatg 
cttactgggt 
gggagaaatg 
cactgtgggt 
gttactcatc 
cccagtattt 
tgtggtagat 
ttattttgca 



ttacagggag 
ggaggagaag 
ggcccggtgt 
gtccgtcgcg 
agcctcaggc 
accgagaggg 
ctgtttcctg 
gggctcagcc 
caatttctag 
atctacagtg 
actttggtgg 
tggtcgctgt 
gcctggtttt 
cagagagtga 
aaccaccttc 
cccgtggagg 
gatttcctga 
tctacctgtg 
ccttctcgct 
gctgttgctg 
acattatctc 
aacatttggg 
acgatagcaa 
ggtgtaaata 
gccagtggcc 
ctcatcgctg 



aaggcgtcac 
gaaggggccg 
cccggcgggc 
aatgctggtt 
cgtccgagcc 
aggtcttcaa 
tgttccttgc 
ttattgttac 
aattttttta 
tggtggacct 
gctttacagt 
tcagcctgaa 
tacctatgcc 
atggcatcaa 
ctggctggga 
aaccggaacc 
tgtgctactc 
gctattttca 
atgctgctat 
tgtttgcagt 
tcttttctct 
tgtgctatgc 
cttttcaaat 
ccttcattgc 
ttggattaga 
tggtcttcct 



tcgcggttac 

ggccgggtcc 

ggcggcggcg 

cttgccgacc 

cttcctgacc 

tgaaatttat 

cacagactac 

atggtttatg 

tggcatcgcc 

gggcatgtac 

gggctctgtc 

tgtcacctct 

acagaagagc 

ggtacaaaat 

ggacattgag 

caagccagac 

ctctcgccct 

agttgtgaac 

ctataatggt 

tggttatata 

cctgattgct 

atcctatgtt 

tgctgcaaac 

cccggcactg 

aattaccact 

ggccagtggt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 
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gcagtcagtg 

accacttcat 

tgcctggatg 

acttcacagc 

aggagccaga 

ttttttctga 

acatggtgtt 

gtatctactt 

gttgcaagtc 

tcttatttct 

atatttgttc 

ttattttaaa 

tctgtaaaat 

atttagttag 

taaggtacgc 

aagttcatat 

ccaaaataaa 

aatttgtttt 

ttcataaact 

tcctgcattt 

tgcttaaaat 

taattctgaa 

aaagctgact 

ttaacttttt 

gtcgacatac 

aggtcatttt 

gaagggaaag 

ttcttttttt 

gaaaagtgct 

ttcatgttac 

gagccttttc 

gaatgctgaa 

tttgttgtac 

ttttacagta 



ttatgaagaa 

aatatactgc 

tatttgattt 

agccacttga 

agtaaagctt 

ttcaaaagtg 

gtatacttca 

agccagatgc 

atgatggatg 

gtgttactca 

tggtttagtg 

aaaattcatc 

ttgtttagaa 

aaaataaggc 

ttgcttgtct 

ataggctaaa 

atgaaaaatc 

tcttatccaa 

gagcctcttt 

ttttacctta 

tgctagtgat 

atcagtctac 

ttgttgcctt 

tagaatgaca 

tcatttggtt 

ctattcacca 

ggaagtgata 

gctttgtttt 

gtttgaggga 

caaagcaata 

ttatataaga 

ctcaatgtgc 

tttttattca 

tattt 



atgtagaaag 

tgaagggctt 

tttaaagcgt 

ctaatacctt 

atttcatgga 

aacttgattt 

ttagcaagtg 

ttggcctggt 

gaaatatgtt 

tgtacattgg 

ttctgtattt 

atatgaatgt 

gtacgttttt 

aagttttgaa 

ttaaatcttc 

gatgtaacta 

tttttttaat 

gtaaagatgt 

tcaattattt 

aatctgcctt 

ttcattacca 

caaggggctg 

atcttgaaaa 

cagccgaaca 

taagcaggaa 

tcctttacta 

tcataaaagt 

gtttgtgcta 

ttttattaaa 

tttccctgga 

caagaatatg 

aagttgtact 

attctgtata 



ctggaagatc 
cttcttatag 
agacatatat 
gtgttccggg 
ttatttatga 
taaaaaccag 
agtttggtgt 
gggaccaggg 
ctaacagcat 
tctttctaca 
taaggtgtac 
tcttggttcc 
gcattattca 
gcctgctaaa 
aatatgaagg 
ggccatttgt 
aatttcatcc 
caataggaat 
caatgggaca 
tgtttctaaa 
agttacttga 
ctagtatatg 
gaatctagat 
gtgttaataa 
tcctagaagc 
ttaagggaaa 
agcaaccttc 
atttgggaat 
tcttttttta 
atttaattca 
ttcacatctt 
gtatgcagtt 
gattataaaa 



cacaatcaag 
caagaactct 
ttatgaatgt 
aataacatga 
gagctaattt 
tcaagagcaa 
tttataggtc 
ctccacagag 
ctgcctctat 
tagttattct 
gtatcacttc 
cattgtgacg 
tatgcttccc 
tgaagagact 
actattaatt 
atttgtattc 
ctatttatag 
tgcattagtc 
ggaactagga 
ggtagatcat 
aaaaatgttc 
tcagacatga 
aggtgctttt 
tagtgtgtca 
aaatggatgg 
ggaaaagaac 
attttacatt 
tgtgtactcc 
atggaatgtg 
aagtttgtgg 
ggtatgtggc 
ttgtaaataa 
ttatttttat 



ttctcaagta 
gcacagcaac 
gcatttcttg 
tactattcag 
aaggatgact 
tcaaagcagc 
acatatgtct 
gccacaaaat 
tcaatttaat 
atcactggca 
taattttaag 
attatttatt 
agagaagctc 
taagaaagct 
ccaagattaa 
ccttttattt 
tttttacatt 
caaggccttt 
tagatgtgat 
cttgaacatt 
tatatgcatt 
aaactatttt 
aactggggta 
agattgcaaa 
ggataagaat 
actagctaag 
ctgtctgttt 
gaaacaagta 
gtacaaattg 
catacaacct 
catatttata 
gtgaaaataa 
taaataaata 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3555 



<210> 81 
<211> 1293 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3438320CB1 



<400> 81 

atatgcccca 

ccctgatatt 

ataaacgctc 

ctgaggagca 

ccggggcctg 

ggaggtggcc 

cgacctaacg 
agttgcaggc 
catagatccg 
tccgcctcag 
gtcgtattat 
gctcacgtcg 
acaccgctat 
gaacctgaac 
tgactatctc 
tgatctcacg 
agtttggctt 
atctccactc 
tgccttcatt 
aagatcaagt 
tgtttgatag 
aatgatggaa 



cgcgggactc 
tatctcagtg 
tcccctcggc 
aaggagtaac 
gtggctgggc 
caacataaca 
tcattggcac 
caggatatca 
ctgaccggct 
ctgccctgtt 
gaggtggggc 
caggagcaca 
ctcccctata 
atggatttta 
agtatggacg 
gagttgtagg 
tttctgtgcc 
ctctccagga 
acagtttgct 
accttggttt 
ttaagagaga 
atgagatgtc 



atactacgtt 
gacggtggcc 
tcccggacgc 
caagagatcc 
cagacttgga 
ggcccgaaga 
aggaatacaa 
gccactggtt 
gcctgaggta 
cggactgggc 
ggctgtctgc 
cactggaggt 
actcacatgc 
ccctggaaga 
gtacacttca 
caaggagatg 
ttgaggaaaa 
gctagcctgt 
ctgagaaaat 
agggagatgt 
gtagttctac 
tgcaggagat 



tcccgtgaac 
ggaaaaggac 
gacggaggtc 
agtgaccgac 
gtattttcag 
cctctgggta 
ggggaacctg 
tgatccaaag 
ctgcaccccg 
caacgatttt 
caagacccgg 

gggggttctg 

tgccagctac 
gaatgggatc 
cacacctgca 
tacactcgtg 
gtggtggggc 
gcccttctga 
tagtgaatta 
agaagaggat 
aggggtgagg 

ggg 



acgtgcagtc 
aatggtttcc 
gtagtagtag 
agagcaagag 
cgtcgctatt 
tcttacctgg 
ctgctgaaac 
accagagaca 
cggggccgct 
gggaagccct 
agcatccgca 
gagtccatat 
acgtggaaat 
cgggatgagg 
atacttctgt 
tagactcaag 
cgaggggtgc 
agtgtaaagg 
atctttggga 
agtcagagtt 
gatggaagga 



caaaccccgc 
atgtcagcgg 
tgagtacgtg 
ccatgccgcg 
tcacgccggc 
gacgcgtgta 
ccatcgtgga 
tccgcaagca 
ttgtgcacgt 
ggtggcaggg 
tcattaacac 
gggaaatcct 
atgaagggaa 
aggaagaatt 
acttcaatga 
acgtatttcg 
ctggacccag 
ccctattccc 
atgatacaag 
caggcagaac 
cttttttggc 



60 

120 

180 

240 

300 

360 

420 

480 
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<210> 82 
<2X1> 1489 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3986488CB1 



<400> 82 

cggggctggc 

aaggcgctgc 

gcagccacca 

tttgataaga 

ataagtggat 

ttcaccattc 

aacatcatcc 

cttgctttta 

aatggagttt 

ttctacaatg 

ctgcaacagg 

ctttcctgtt 

tcagccctga 

atattaatcg 

atggcagggg 

gtgggtgaag 

gactgacttg 

tcggattcgt 

tacctcatat 

cctgtggaaa 

tgctcttcta 

gttgatgtaa 

ccaaagcaaa 

gttaactgcc 

aatcatcaaa 



gggcggcgct 
tgaccaccta 
taatgatacc 
aaattcctgt 
tatcaagcac 
cacttacctt 
tcagtgtctt 
acttagaagg 
ataccaaaca 
cctgcttcat 
ctactgaatt 
ttttggggtt 
cgacagcagt 
gtggagacta 
gcttgagata 
aaaacatctg 
tgactgcggg 
gacatccacc 
gcaggatgag 
gcaaatcttt 
gggacctatg 
tggtgactct 
gggccagttg 
agtaaagtta 
gtaaataaaa 



cttctacggg 
cggtttcccg 
atatgtgtcc 
aaagctgttt 
aagtaaatta 
acttctggaa 
tgccattatt 
ctatattttt 
gaaaatggac 
gattatccca 
caaccaatgg 
tctgctgatg 
ggttggagcc 
cattttctct 
ttccttttta 
tttggatttg 
ctgggggggc 
ccctgggcaa 
ccaatggcag 
tcaaaataag 
gggccttcgt 
tttctgatca 
tttcagttta 
tttgttagct 
aatcatttct 



acctgctcct 
tcaccaattt 
aagctaaaca 
cctctgcctc 
agcctaccga 
accatcatac 
ctcggggctt 
gtattcctga 
ccaaaggagc 
actcttatta 
aagaatgttg 
tactccacgg 
atcaagaatg 
ttgttaaact 
acactgagca 
aagagctaaa 
attcccagta 
gtgagagcat 
tctcaagaaa 
ccactgggac 
ggcatctctg 
gcaccttggc 
aacagacatg 
ttgatgaaag 
atgtaaaaaa 



tcctcatcgt 
tccttggaat 
aaatcattca 
tcctctacgt 
tgttcaccgt 
ttgggaagca 
tcatagcagc 
atgatatctt 
tagggaaata 
ttagtgtctc 
tgtttatcct 
ttctgtgcag 
tatccgttgc 
ttgtagggtt 
gccagttaaa 
gagtctgcag 
ggaatgtgaa 
ctgcaaaatg 
tgtactcggg 
tcggtaggtg 
tgctgtgtgc 
cgtgattccc 
tctttagtct 
ctatgttggt 
aaaaaaaaa 



gcttgtcaac 
tggacagatg 
cttccctgat 
tggaaaccac 
gctcaggaaa 
gtattcactc 
tgggtctgac 
cacagcagca 
cggagtactt 
cactggagac 
acagtttctt 
ctattacaat 
ctacattggg 
aaatatttgc 
acctaaacct 
caggattgga 
gccagaggtt 
caaagagaac 
cgacacctta 
gagccccagc 
tggggaggag 
aaggtcccag 
aataaaatta 
atctttccct 



60 

120 

180 

240 

300 

36a 

420 

480 

540 

600 

660 

720 

780 

840 
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<210> 83 
<211> 927 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 4378816CB1 



<400> 83 

ctccctgctt 

gcgctgagtg 

tggcggcggg 

gggcgctcat 

tctacttgtc 

aagaatatgg 

aaattctgac 

aagaaaaata 

gctggatgac 

ggctgtactg 

tgctactgtg 

tgaagaagtt 

aatgaatcaa 

tgtattgttg 

tattgttgtt 

taaaatgttt 



tcctctgccg 
ggagctgggg 
ctgcgccctg 
ctggctctgc 
tttagtagga 
caaagctgat 
cgtcgccctg 
ttaccggcat 
cttcctccca 
ttggctttac 
gcagtcatgg 
tcagtgaact 
atctttttgt 
ctcctgaaca 
caattaaatg 
aattatacaa 



catggtcctg 
gccgaggctg 
ggcctgcgcc 
tacgacgcgc 
aacgttgcaa 
gcaagatggg 
gatgggtctc 
ttcctgcaga 
gagtggctca 
ctgttttttt 
ctagaactca 
ttcaaaacca 
ttggccaaaa 
acctgtttca 
gtgatatagg 
aaaaaaa 



ggccgttggc 
gcggttcgct 
tgggccgcgg 
tggtgcactt 
attccgatgg 
tttattttga 
tggcattgtt 
tcaccctgtg 
ccagaagccc 
ttaacggtgt 
agaaaatgca 
taaacaccat 
tgtaatacat 
aattggtttt 
gaaaagagaa 



gtcggaagcc 
gctgctgtgc 
gcagggggcg 
cgcgctggaa 
cttgattgct 
tccaaccatt 
cctcatttat 
cgtgtgcgag 
caacctcaac 
gtgggttctg 
tcagaaagaa 
tatctaactt 
tccagtctac 
aaggcgacca 
caaatttgaa 



tgaagcatgg 60 
gccgcgctgc 120 
gcggaccgcg 180 
ggcccttttg 240 
tctttatgga 300 
gtgtctgtgg 360 
gccatagtca 420 
ctgtatggct 480 
accagcaact 540 
atcccaggac 600 
accagttcag 660 
catgaaccag 720 
actttgtttt 780 
gttttcgttg 840 
tttgtaataa 900 
927 



<210> 84 
<211> 970 
<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 4797137CB1 

<400> 84 

tSStStt? SSSSfS? ?«ggaagcc gtggctgcgc tctcttccct ctgctgggcg 60 
Sglggtta tSfggaaaa ^EE^ "tccttgga gattcgtgca ga?gcccat? 120 
tcaltgacaa glSaltata SS?? tCaa9 J tgaaatgcac tttcaagtca acttcagacg 180 
caata^tca" ftatcagtct ftccaallacc tlt??^ cagcagcagc cacacagtat 240 
tttcctgggt tgaaaatet, ^f- 9 caaccacagc aggcacattt cgggatcgga 300 
taaaaaacaa r2£!EE* tacaaa 9ggg atgcatctat aagtataagc aaccctacca 360 
ttcccltaac ll^t agctgtgct * tgaagaatcc cccagatgtg caccataata 420 
cccttct?tc cltfcttatc g ^ a ' agaaa WBBttttgg caccatgct! tcctctgtgg tlo 
gaatggggag gaaggctgct laltlZZZZl cagccgtggt ggttgctctg ctgctggtga 540 
ctattS ► gggctgaaga agaggagcag gtctggctat aagaagtcat 600 

gtatcIo??a r^Sf? 30 actgatcagg aggaggaaga ggcgtgtatg gcgaggcttt 660 
latgaclcaf alaalafc^ "ggattcag a «atgaaga gacatattgl Egaalgtctg ?20 
ctgfcaglaa aaS^ ctaaagacag gaaacatccc attccactgg cagctaaagc 780 
taaaaaetlt 9 I ggcct 99 a «=c atagcgatgg acaatcctgg agatcatcag 840 

tcaalalatc SS^S** tatttatt 9 a ataaatgttc ttgttgtaEE tataaactg! 900 
tgafgaaltc tCataagaga "catgactt cccctttcaa tgaattatgc tgtaattgla 966 

970 

<210> 85 
<211> 594 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 5470806CB1 

<400> 85 

gllglfggcc ctooaaa^l CCtg " cctc ctctgcctcg ctgtactgat atttgcgtct 60 
gctglclgag gcfa™^ ^? a9aatCC aagtgtcc t<= tgatggtcaa agtcctggat 120 
ggaacctggg facl^tt^ * g " gatgtg gccgtgaaag tgttcaaaaa gactgcagac 180 
alcSalg alllStcE ™E" gaag accgccga 9 c ctggagagct gcacgggctc 240 
tggaagfctf a ?oocatt a ^ ggaaggggtg tacagggtag aactggacac caaatcgtac 300 
glltc?ggtc atloccactf ™™ CCat gaatacgca * aggtggtttt cacagccaat 360 
ictgctgScq ?cS^ ^ aCCatCgCa 9 cc «g«ca gcccgtactc ctacagcacc 420 
ccalagcag? agcltcccat ttal^nt* gggacccagc "agtaggac caggatcttg 480 
caaetem™ iJ^^ ? tc 9 tac tgaa acagtgttct tgctctataa accgtgttag 540 
caactcggga agatgccgtg aaacgatctt attaaaccac ctgtgatgcc aaaa 594 

<210> 86 

<211> 618 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_£eature 

<223> Incyte ID No: 5473242CB1 

<400> 86 

tactaSaat f-^^" 9 gaaca ^ a = a = catggtgcac ctaactgatg ctgagaaggc 60 
cagtctgctg 2«SSt2S g3aaggtgaa ecctgttffaa attggcgctg agiccctSc 120 
ctctgtctct octlt^ 3 ^ c " ggaccca gaggtacttt tctaaatttg gggacctgtc 180 
cgccltcSt altaor 3 ^! gtaaccccca ggtgaaggcc catggcgaaa aggtgataaa 240 
tglactcSc ?atlf™ a aaca "5 g 5 a caacctcaag ggcacctttg ccagcctcag 300 
glttgtgatt JElEE^ tgcatgtgga tcctgagaac ttcaggctcc tgggcaataS 360 
rttccalaaa ota«^ S accacctggg caaggaattc accccgagtg cacaggctgc 420 
ttttcclgcf ct?«r?^ 9 gagtggc <= a ST tgccctggct cacaagtacc actaaacctc 480 
ttatclalat ™2£ " g tgcaat 5gtc aattgttccc aagagagctt ctgtcagttg 540 
tgftctgttg Igatacic" " tgaaaatc tgtectacf attaaagcat ttggttfaag 600 
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